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Lorentz-violating light rays in a weak gravitational background by implementing the principle 
of minimal coupling. Thereby it is shown how Lorentz violation in the photon sector can 
be constrained by measurements of light bending at massive bodies such as the Sun. The 
phenomenological studies are based on the currently running ESA mission GAIA and the 
planned NASA/ESA mission LATOR. The final part of the paper discusses certain aspects 
of explicit Lorentz violation in gravity based on the setting of Finsler geometry. 

PACS numbers: ll.30.Cp, ll.80.Fv, 42.15.-i, 02.40.-k 

Keywords: Lorentz violation; Eikonal approximation; Geometrical optics; Differential geometry 


mschreck@indiana.edu 




2 


I. INTRODUCTION 

During the past 15 years plenty of progress has been made in understanding CPT- and Lorentz 
violation and its possible implications on physics from both a theoretical and a phenomenological 
point of view. This was made possible by establishing the Standard-Model Extension (SME) in 
1998 [I] and by the subsequent tireless work of people in our community eager to study imprints of 
Planck-scale physics detectable by experiments operating at much smaller energies. The SME is a 
powerful framework incorporating all Lorentz-violating operators into the Standard Model of ele¬ 
mentary particles and General Relativity. It neither modihes the gauge structure of the Standard 
Model nor does it introduce new particles. The power-counting renormalizable contributions of 
the SME are grouped into its minimal part where the remaining higher-order operators comprise 
the nonminimal SME ISHH. This framework allows for astounding experimental tests of Lorentz 
invariance where even presently some experiments reach a sensitivity of the Planck scale square 
(see [5] for a yearly updated compilation of experimental contraints on Lorentz-violating coeffi¬ 
cients). Since Lorentz violation implies UP T-violation according to a theorem by Greenberg [S], 
the Standard-Model Extension involves all GPT-odd operators as a subset. Note that Lorentz 
violation has been predicted by various prototypes of fundamental theories such as string theory 
|M], loop quantum gravity mm noncommutative spacetime mm^ spacetime foam [HHE], 
and models with nontrivial spacetime topology [laiiH]. 

In the recent past profound studies of modified quantum field theories based on the SME were 
performed at tree-level and including quantum corrections. The result of these studies is that most 
sectors are free of any inconsistencies |19H32] . Furthermore the SME was explicitly shown to be 
renormalizable at one loop |331l36j where latest computations have demonstrated renormalizability 
of the modified quantum electrodynamics m and the pure Yang-Mills sector [38] at infinite- 
loop order using algebraic techniques. Therefore as long as the SME is restricted to Minkowski 
spacetime, it seems to be a reasonable, well-behaved, and model-independent test framework for 
Planck-scale physics. 

The gravitational sector of the SME was constructed in the seminal article |39|. In the aftermath, 
studies on its theory and phenomenology were performed in a successive series of papers [4nH49| 
with recent investigations of even nonminimal operators in short-range gravity tests [501 ED- One 
of the most important theoretical results of [39] is a no-go theorem stating that explicit Lorentz 
violation is incompatible with the geometric framework of General Relativity, which is Riemannian 
geometry. Gonsidering Lorentz-violating matter in a gravitational background results in modified 
conservation laws of the energy-momentum tensor based on Noether’s theorem. However Lorentz 
violation does a priori not modify the geometrical base such as the Bianchi identities of the Riemann 
curvature tensor. Due to the Einstein equations the second Bianchi identity is tightly bound to 
the conservation of energy-momentum, which is then incommensurate with the modified matter 
sector. 

A possibility of circumventing this clash is to perform phenomenological studies in theories 
resting on spontaneous Lorentz violation. This means that a Lorentz-violating background held 
arises dynamically as the vacuum expectation value of a vector or tensor held. Such models have 
been studied since the early 1990s [3 (even before the SME existed) and they can be 

considered as one of the motivations that lead to the construction of the SME. The crucial point 
within models of spontaneous Lorentz violation is to take into account the Nambu-Goldstone modes 
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that are linked to the symmetry breaking. This can lead to arduous perturbative calculations within 
such a theory. 

For these reasons it would be preferable to have a setup available that allows for incorporating 
explicit Lorentz violation into a curved background without possible tensions with the underlying 
geometrical properties. A suggestion was already given in [3H] along the same lines as the no-go 
theorem: introducing an alternative geometrical framework that can include preferred directions 
naturally. Such an extension of Riemannian geometry has been known in the mathematics commu¬ 
nity for almost 100 years. It is named Finsler geometry in reference to the famous mathematician 
Finsler who studied generalized path length functionals in his Ph.D. thesis [571 [58] (cf. |59j for a 
comprehensive mathematical overview on the subject). 

Finsler geometry has been applied to various fields of physics |60j . In the context of the 
Standard-Model Extension it found its use just a couple of years ago when it was shown that 
the minimal Lorentz-violating fermion sector can be mapped to classical-particle descriptions M- 
165] . The corresponding Lagrange functions are closely linked to Finsler structures, i.e., generalized 
path length functionals. Recently a nonminimal case was studied [HS] as well as classical-particle 
trajectories in electromagnetic fields and modified spin precession based on an isotropic set of 
minimal fermion coefficients |67] . In [68] a particular class of Finsler spaces known as bipartite is 
investigated closer from a physics point of view and [69| suggests classical-mechanics systems that 
are linked to three-dimensional versions of Finsler b space |62) . In a very recent paper [TO] b space is 
discussed from a mathematical point of view. Its indicatrix (surface of constant value of the Finsler 
structure) is a two-valued deformation from a sphere that is characterized by singularities with am¬ 
biguous derivatives. Considering the indicatrix as an algebraic variety, the Hironaka theorem says 
that such singularities can be removed [7l]. In [70j a coordinate transformation was found, which 
allows to remove the singular sets and to glue the remaining parts together appropriately. This 
results in a well-defined mathematical description of b space that can be used for future physical 
investigations. 

The goals of the current article are threefold. First, analogous classical equivalents for the 
minimal CPT-even photon sector of the SME shall be found. Second, with these equivalents at hand 
we intend to study phenomenological aspects of Lorentz-violating photons in weak gravitational 
fields. Last but not least we would like to understand various consequences of this approach 
on the base of Einsler geometry. The procedures to be developed will differ extensively from 
the SME fermion counterparts. The paper is organized as follows. In Sec.|TI]the Lorentz-violating 
framework, which all investigations are based on, is introduced. A brief review on Einsler geometry 
and Einsler structures in the SME fermion sector is given in Sec. EH followed by an explanation of 
the method to constructing Einsler structures in the photon sector. In that section we investigate 
different cases that are the most interesting ones from a physics point of view. In the geometric- 
optics approximation photons are described by the eikonal equation, which forms the cornerstone of 
Sec.HVj It is demonstrated how the Einsler structures obtained are linked to the eikonal equation 
for the different sectors analyzed in the previous section. Since the isotropic modification of the 
CPT-even sector can be considered to be the most important one, all forthcoming studies will be 
based on the latter. Section |V] is dedicated to investigating the isotropic eikonal equation in a 
weak gravitational background. We develop a phenomenological framework to study light bending 
at massive bodies within such a theory. In this context prospects are given on detecting isotropic 
Lorentz violation of photons propagating in a gravitational background. This is carried out for 
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two space-based missions: GAIA and LATOR. The final part of the paper is more theoretical. In 
Sec. IVIl the modified conservation law of the energy-momentum tensor is investigated, interpreting 
the results from the point of view of explicit versus spontaneous Lorentz violation. Last but not 
least, in Sec. IVIII we examine the properties of the isotropic spacetime that has been subject to the 
studies in Sec. |V]from a Finsler-geometric point of view. The most important findings in total are 
concluded on and discussed in Sec. IVIIIl Essential calculational details can be found in Appx. 
to[^ Throughout the article natural units with h = c = 1 are chosen unless otherwise stated. 


II. CONSTRUCTION OF CLASSICAL LAGRANGIANS 

The base of the current article is formed by the minimal SME photon sector whose action Sy is 
comprised of CPT-eyen modified Maxwell (mM) [H |72l [73] theory and CRT-odd Maxwell-Chern- 
Simons (MCS) theory [TI I72H74] : 

Sy — / d X [eTm]y[(^) T 'TmCs(®) T 'Tmass(3j)] , (2.1a) 

X'muix) = J (2.1b) 

J^MCsix) = '^{kAFTE^x^.uA\x)F^'^{x) , (2.1c) 

Fmassix) = myA^{x)A^{x). (2.1d) 

Here FA x) — (9^Aj/(x) — dyA^{x') is the electromagnetic field strength tensor that involves the 
[/(I) gauge field A^(x). The fields are defined on Minkowski spacetime with metric {rj^y) = 
diag(l, —1, —1, —1). The totally antisymmetric Levi-Civita symbol in four spacetime dimensions 
is denoted as with ^0123 _ controlling coefficients characteristic for the framework 

considered are comprised in the fourth-rank observer tensor and the observer vector 

(kAp)'^- Both have dimensionless components and they do not transform covariantly with respect 
to particle Lorentz transformations, which renders this theory explicitly Lorentz-violating. The 
field operator of modified Maxwell theory is of dimension four, whereas the operator of MCS 
theory has mass dimension three. Therefore MCS theory involves the Chern-Simons mass scale 
rrics for dimensional consistency. It is well-known that a photon mass term encoded in cTmass 
(with the photon mass niy) violates U (1) gauge invariance. It has been introduced here for certain 
purposes that will be explained below, but for most occasions niy will be set to zero. Anyhow 
in [26| it was demonstrated that certain birefringent cases of modified Maxwell theory require a 
nonvanishing photon mass (at least in intermediate calculations) to have a consistent Gupta-Bleuler 
quantization. Finally, a gauge fixing term will be omitted in the action, since all considerations 
will be carried out at the classical level. 


A. Classical Lagrangians and Finsler structures 


The major goal is to understand how Lorentz-violating photons can be described in the context 
of gravity. Since Einstein’s relativity is a classical theory, it is reasonable to obtain a classical 
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analogue of the quantum field theory based on the action of Eq. (2.1). With such an analogue 


at hand it should be possible to study how an explicitly Lorentz-violating theory of gravity conld 
be constructed consistently. As an introduction to the topic the mapping procedure of the SME 
fermion sector to a classical point-particle description |61) shall be reviewed. From a quantnm 
theoretical point of view a particle can be understood as a suitable superposition of free-held 
solutions with dispersion relation 


= 0, {Pf,) = 


( 2 . 2 ) 


such that its probability density is nonzero in a localized region and drops off to zero sufficiently 
fast outside. Here po is the particle energy, p its three-momentum, the fermion mass, and 
kx denotes a particular set of Lorentz-violating coefficients where x represents a Lorentz index 
structure. The physical propagation velocity of such a wave packet is the group velocity 


dpo 

5p 


(2.3) 


A classical, relativistic pointlike particle is assumed to propagate with four-velocity = 7 ( 1 , v) 
where v is the three-velocity. To map the wave packet to such a classical particle, it makes sense 
to identify the group velocity components with the appropriate spatial four-velocity components: 


Vgr = - 


u 

T) 


u 


(2.4) 


The minus sign has its origin in the different position of the spatial index on both sides of the 
equation. Since the physics of the classical particle rests on a Lagrange function L = L(uP,u), 
its construction is of paramount importance. If the Lagrange function is positive homogeneous of 
degree one, i.e., L{XvP, Au) = XL[u^, u) for A > 0, the action is parameterization-invariant. In this 
case the physics does not depend on the way how the particle trajectory is parameterized, which 
is a very reasonable property to have. Positive homogeneity gives the following condition on the 
Lagrange density according to Enler’s theorem [59]: 


L = -p^u^^ , Pfj. = - 


dL 




(2.5) 


with the conjugate momentum p^. The latter is identified with the momentum that appears in 
the quantum theoretical dispersion relation of Eq. (2.2). The global minus sign in its definition 


has been introduced such that the nonrelativistic kinetic energy is positive. Now Eqs. (2.2), (2.4), 


and (2.5) comprise a set of five conditions that shall be used to determine and L. Hence all 
four-momentum components and the Lagrange function are supposed to be solely expressed in 
terms of four-velocity components. 

The Lagrange functions corresponding to the standard fermion dispersion law Pq — 'P^ — rn^ = 0 
read L = -s/ (u^)^ — u^. The two signs are the classical counterparts of the particle-antiparticle 

solutions at the level of quantum field theory. It can be checked that the five equations above are 
fulfilled for this choice of L. The latter can also be written in the form L = with 

known as the intrinsic metric. This metric is essential to determine lengths of vectors and 
angles enclosed by vectors. In the particular case considered it corresponds to the (indefinite) 
Minkowski metric: = rj^y. This is not surprising, since the starting point to obtaining the 
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Lagrange function was a field theory defined in Minkowski spacetime. By a Wick rotation the 
Lagrange function is related to a new function F based on a positive definite intrinsic metric: 

F{y) = ^(y> /) = —L{iy^, y) = \lnjV^y^ , = diag(l, l, l, . (2.6) 

Promoting to an arbitrary position-dependent metric rij(x), the function F becomes dependent 
on x: F{y) i—)■ F(x,y). It can then be interpreted as the integrand of a path length functional of a 
Riemannian manifold M where y G T^M. A Finsler structure is a generalization of that obeying 
the following properties: 


1) F{x,y) > 0, 

2) F{x, y) G for all y G T^M \ {slits}, 

3) positive homogeneity in y, i.e., F{x,\y) = XF{x,y) for A > 0, and 

4) the derived metric (Finsler metric) 

_ 1 92^2 
^ 2 dy^dy^ ’ 
is positive definite. 


(2.7) 


Prominent examples for Finsler structures that are outside the scope of Riemannian geometry are 
Randers structures, F{y) = a + (3, and Kropina structures, F{y) = jf3, with a = ^/aijy^ and 
/3 = biy^ where aij is a Riemannian metric and a one-form. There are certain theorems available 
to classify Finsler structures using various kinds of torsions. The most important one is the Cartan 
torsion Cijk, which is given by [75] 

C-- - ^ ^ 

* 2 ^ 2 dy^ Ady'-dy^dy^ 

In some books Cijk is defined with an additional prefactor F (see, e.g., [59]). The mean Cartan 
torsion reads as follows: 


l = hy\ Ii = g^’^Cijk, {g^n^{9^j) 


-1 


(2.9) 


with the inverse derived metric g^F Deicke’s theorem says that a Finsler space is Riemannian if 
and only if I vanishes m- The Matsumoto torsion provides a further set of quantities that are 
very useful to classify Finsler structures: 

1 .. , . . ... . ^ d'^F 


A^ijk — Ciik 


ijk 


n -|- 1 


(/j/ij'fc -|- Ijhik T Ikhijj 


^*2 ^~dy^dy3 


( 2 . 10 ) 


Here n is the dimension of the Finsler structure considered m- According to the Matsumoto-Hojo 
theorem a Finsler structure is either of Randers or Kropina type if and only if the Matsumoto 
torsion is equal to zero [77]. These theorems will be used frequently throughout the paper to 
classify Finsler structures encountered. 

According to the rules recalled at the beginning of the current section classical Lagrange func¬ 
tions of the SME fermion sector were derived in [6T] [64II67] . In the articles [621 - 1631 166] their 
corresponding Finsler structures were examined. In this paper analogous investigations shall be 
performed for the minimal SME photon sector based on the action of Eq. (2.1). It will become 


evident that the possible techniques used differ from the procedures adopted for the fermion sector. 
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B. Maxwell-Chern-Simons theory 


In the current section the CPT-even photon sector components in Eq. (2.1b) will be set 


to zero restricting our considerations to the MCS term of Eq. (2.1c) only. Furthermore the photon 


mass niy will be set to zero as well. In the seminal article m the magnitude of rncsikApY 
constrained tightly due to the absence of astrophysical birefringence. A collection of all constraints 
on components of rricsikAFY can be found in the data tables |5]. In spite of the tight bounds, 
MCS theory is very interesting from a theoretical point of view. The structure of the quantum field 
theory based on MCS theory is quite involved, which was shown by extensive investigations carried 
out in [20] 0 The smoking-gun results of the latter reference are that MCS theory is well-behaved 
as long as the preferred spacetime direction (kAp)^ is spacelike. For timelike {kAp)'^ issues with 
either microcausality or unitarity arise, though. Interestingly this behavior mirrors in the classical 
Finsler structure of MCS theory that will be derived as follows. First of all spacelike MCS theory 
shall be considered. The modified field equations in momentum space read as follows [T]: 


M“^(p)A5 = 0, 

= rf^k^ — — 2imcs{kAp) ky , 


(2.11a) 

(2.11b) 


where k^j, is the four-momentum to be distinguished from the four-momentum used for fermions. 
Imposing Lorenz gauge k^As = 0, the condition of a vanishing determinant of M results in 


+ 4mcs [k^{kApf - (k ■ kApf] = 0 , 
leading to the following dispersion relations: 


wi,2 = Y^k2 2mlg(kAp)^ ± 2mcs\/m|s(lcAF)^ + (k • ^ap)^ ■ 


( 2 . 12 ) 


(2.13) 


Here the spatial momentum k is not to be confused with the spatial part k^j? of the MCS vector. 
Following the procedure outlined in App. |A 2 leads to the Lagrange function 


i± 


-^Imcs 


±mcs -{kApYu^ ± \/{kAP ■ uY - {kApYv?^ . 


(2.14) 


First of all, this result matches the Lagrange function first obtained in m- For spacelike kAP 
it corresponds to the Lagrange density of the minimal fermionic coefficient where here (kAp)^ 
takes the role of and the Chern-Simons mass rrics takes the role of the fermion mass m^. This is 
because there exists a correspondence between MCS theory and the fermion theory involving the b^ 
coefficient whose Lagrangian has the form The associated field operator is of dimension 

three and it is CPT-odd [2], which parallels some of the properties of MCS theory. Therefore the 


Wick-rotated version of Eq. (2.14) can be interpreted as a b space. The form of the Lagrangian 


of Eq. (2.14) remains the same even for MCS theory with a timelike kAP, which can be shown by 


direct computation. Undoubtedly, issues arise for timelike kAP, since in this case the Lagrange 
function is not a real function any more. 


A classical Lagrange function is of mass dimension one, which is why Eq. (2.14) is directly 


proportional to the single mass scale rrics that appears in this framework. In the limit rrics t 0 the 


^ Note that the global prefactor of MCS-theory is different in the latter reference. 
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Lagrange function vanishes, which reveals the challenge in deriving appropriate Lagrange functions 
corresponding to Lorentz-violating frameworks that do not have a dimensional scale associated to 
them. This is especially the case for a photon theory based on modified Maxwell theory, which will 
be discussed as follows. 


C. Modified Maxwell theory 


In the remainder of the paper the Chern-Simons mass rrics will be set to zero and the Lagrange 


density of MCS theory, Eq. (2.1c), will not be taken into account any more. The observer four- 


tensor in Eq. (2.1b) will be decomposed into contributions involving the Minkowski metric 

and a (4 X 4) matrix according to the nonbirefringent Ansatz |73l [79] 




(2.15) 


The matrix is supposed to be symmetric and traceless. Its particular choice amounts to different 
Lorentz-violating cases in the minimal, CPT-even photon sector characterized by nonbirefringent 
photon dispersion laws at first order in the Lorentz-violating coefficients. This means that resulting 
dispersion relations for the two physical photon polarization states coincide with each other at first 
order in Lorentz violation. The notation — especially for the controlling coefficients — is mainly 
based on [72] . 

First of all the photon mass is kept. The equations of motion for the photon field in 
momentum space then take the following form mm- 


M^\k)As = 0 , 


(2.16a) 


M^Yk) = ri^\k^ - m^) - k^k^ - 2{kFT^^^kpk^ . (2.16b) 

Now different interesting cases of modified Maxwell theory (including a photon mass term) will be 
examined. The simplest case is undoubtedly the isotropic one, which is characterized by a single 
controlling coefficient Ktr and one preferred timelike spacetime direction The matrix is 
then diagonal and it is given as follows: 

= 2Ktr (er - = ^^trdiag (l, 0''" , (2.17a) 


(e) = (1,0, 0,0)^. (2.17b) 

The dispersion equation, which follows from claiming a vanishing determinant of in Eq. (2.16) 
using Lorenz gauge k^A^ = 0, results in 


ruy = a^^'kf.ki^ , 

— diag (1 -|- Ktr, [1 ^tr]) [1 ^trj) [1 ^tr]) 


flV 


(2.18a) 

(2.18b) 


The next case to be considered is a nonbirefringent, anisotropic one that is characterized by a single 
(parity-even) controlling coefficient Kgl and one spacelike direction Furthermore k^L = 
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= —2kII_ and all remaining ones vanish. The matrix for the nonbirefringent Ansatz is 
given as follows: 


1 


( eC - diag(l, -1, -1, 3) 


\fiu 


(2.19a) 


(C^) = (0,0,0,!)' 


(2.19b) 


The latter has a similar structure compared to Eq. (2.17a) and it is again diagonal. However its 
spatial coefficients differ from each other revealing the anisotropy. The modified photon dispersion 
equation can be written in the same form as for the isotropic case: 


ruy = , 


= diag ( 1 + 




1 + -K 


11 


1 — -K 


11 


flU 


(2.20a) 

(2.20b) 


The third particular case of modified Maxwell theory to be examined in this context is characterized 
by three (parity-odd) controlling coefficients and where all remaining ones that are 

not related by symmetries vanish. Furthermore there are two preferred spacetime directions: a 
timelike direction and a spacelike one The matrix in the nonbirefrigent Ansatz can be 
cast into 




(^^) = (1,0,0,0)'', (C^) =-2(0,0'', C = {kll,kli,kllf 


(2.21a) 

(2.21b) 


Due to observer Lorentz invariance the coordinate system can be set up such that ^ points along 
its third axis. The first photon dispersion equation is quadratic and reads as follows: 


ruy = c^^^'k^ku , 


/ 1 0 0 -g\ 

0-100 
0 0-10 
\-g 0 0 -ly 


& = \l{kf,? + {kf,Y + {k^lY. 


(2.22a) 


(2.22b) 


Note that the latter has an equivalent structure to Eqs. (2.18a), (2.20a). However the second 
dispersion equation is quartic and it is given by 


1 


0 = (fc^ - - {k ■ C){k ■ 0(fc^ - "ly) + T {(fc • C)^ + • 0^ - k'^]} • 


(2.23) 


For niy = 0 the right-hand side of the latter factorizes into and a quadratic dispersion relation 
that differs from Eq. (2.22) (for ruy = 0) at second order in the controlling coefficients. The 
nonbirefringent Ansatz of Eq. (2.15) prevents birefringence to occur only at leading order in Lorentz 
violation. 

Now the classical Lagrange functions for all cases previously introduced are given as follows. 
The derivation for one particular of those is shown in App. A 1 and it works analogously for the 
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remaining ones. For the isotropic case (denoted as ®) the Lagrange functions read as 

1 1 1 1 


{af,u) = diag 




1 + Ktr 1 - «^tr 1 - Ktr 1 “ ^tr 

For the nonbirefringent, anisotropic case (®) they are given by 

1 1 1 


T 1^ 


(biiu) = diag 


l + (3/2)Kii’ l + (3/2)^il’ 1 + (3/2)kIV 1 - (3/2)«ii 


= {b^-^ , 

Finally for the first dispersion relation of the parity-odd case 
L|| = ±my^yc^uUf^u’' , 

/ 1 /( 1 + £ 2 ) 0 0 -6/{l + e^)\ 

0 -10 0 
0 0-1 0 
y—£/(l-|-£^) 0 0 —l/(l-|-£^)y 


we obtain 


{CfMu) — 




(2.24a) 

(2.24b) 

(2.25a) 

(2.25b) 

(2.26a) 

(2.26b) 


Finding a classical Lagrangian that corresponds to the quartic dispersion equation of Eq. (2.23) is 
a challenging task that we leave for the future. The examples for Lagrange functions of Lorentz- 
violating photons in Eq. (2.24a), Eq. (2.25a), and Eq. (2.26a) reveal the general behavior. When 
the photon dispersion equation is of the form = rriy with an invertible (4 x 4) matrix Q 

the associated Lagrange function generically reads as (see [6T] for the fermion analogue): 

(2.27) 

These Lagrange functions rely on the existence of a nonzero photon mass. In general, Lagrange 
functions are of mass dimension one, which is why they have to involve some dimensionful scale 
characteristic for the physical problem considered. For the classical fermionic point-particle ana¬ 
logues studied in m this scale corresponds to the particle mass. In MCS theory the Chern-Simons 
mass rrics takes the role of the characteristic dimensionful scale as we saw in Eq. (2.14). However 
since modified Maxwell theory does not involve a dimensionful scale, a photon mass niy had to be 
introduced to construct Lagrange functions for the classical point-particle analogues. 


D. Classical wavefront 

A photon mass is undoubtedly not an attractive feature in a theory, since the mass term violates 
gauge invariance. Even if a photon mass has to be introduced as an intermediate ingredient to 
regularize infrared divergences in quantum corrections or to grant a consistent quantization of a 
particular Lorentz-violating framework, cf. [26], it should be possible to consider the limit my i—)• 0 
at the end of any calculation. Eor this reason an alternative procedure shall be developed to obtain 
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the classical analogue of (Lorentz-violating) photons. Classically, an electromagnetic pulse makes 
up a wavefront that can be interpreted as a surface in four-dimensional spacetime: w = w{t, x) = 0. 
In a Lorentz-invariant theory it fulfills the following equation [80] : 


dw\ _ _ ,9 

-{V») =0, 


Computing the square root and choosing one particular sign results in: 


(2.28) 


dw 


' ( dw 


dw 




f dw 


(2.29) 


The latter is a Hamilton-Jacobi equation where w is understood as the action S and the expression 
on the right-hand side as the Hamilton function: 


95 

iH 


+ H{x,'VS) = 0, S{t,x.) = w{t,x.), H{x,'k) = , 


where k is the wave vector (momentum). Examples that obey Eq. (2.29) are 
w = t — a ■ X, |a| = l, 


(2.30) 


(2.31a) 


w = t — vx 


(2.31b) 


The first describes a plane wavefront with unit normal vector a and the second a spherical wave- 
front. This can be seen by equating w with zero and considering a fixed value for t. Introducing A 
as a parameter for the trajectory of the wave, both wavefronts can be differentiated with respect 
to A, which leads to 


dw 0 - 


dw 


= u — 




dt 


u = 




(2.32a) 


(2.32b) 


At a first glance it may be assumed that the latter are suitable Lagrange functions, since they are 
positively homogeneous of degree one. However computing the derived metrics g^u according to 


1 d‘^L‘^ 

2 duf^du'^ ’ 


dfii' — 


(2.33) 


quickly reveals that their resulting determinants vanish. Therefore such a is not invertible and 
definitely fails to describe a possible (pseudo)-Einsler structure. This is a result that can be shown 
to hold in general. Assume that a Lagrange function L exists describing the classical wave-front 
analogue of photons. Then the associated conjugated momentum must be lightlike to obey the 
photon dispersion relation: 




(2.34) 
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Due to rotational symmetry in the Lorentz-invariant case it is sufficient to consider a (1+1)- 
dimensional spacetime, which is why a lightlike must be of the form stated in Eq. 

C°° function f{u^,u) where u= |u|. The derived metric is then given by 

_ 1 ^2+2 _ ^ Q-iL dL dL 

2 du^^du^ duf^du’^ duf^ du'^ 

''(±/< i ) (±/2 f ) 

where (1) denotes differentiation with respect to and (2) means differentiation by u. It clearly 
holds that <lei{g^u) = 0 irrespective of the unknown function /. Therefore a Lagrange function L 
with an invertible derived metric cannot exist in the photon case. Because of this an alternative 
procedure has to be developed to assign a possible (pseudo)-Finsler structure to photons, which 
will be examined in what follows. 


( Lfm + p L/®±/2 \ 
1±(L/M + /2) ±(L/®±/2)l ' 

\ / ai/ 


2.34) with a 


III. FINSLER STRUCTURES OF THE PHOTON SECTOR 

In the previous section it was motivated that the usual method to finding Finsler structures in 
the fermion sector does not seem to work in the minimal CPT-even photon sector. The reason 
is the absence of a dimensionful physical scale needed for dimensional consistency of a Lagrange 
function. Photons must be treated differently from fermions to obtain something like a classical 
description. This shall be undertaken in the current section. 


A. Lorentz-invariant case 


To become familiar with our goals, the situation in standard electrodynamics will be described 
first. In a Lorentz-invariant vacuum Maxwell’s equations in momentum space read as follows: 


k X B + wE = 0 , k X E — wB = 0 . 


k-E = 0, k-B = 0. 


(3.1a) 

(3.1b) 


Here E is the electric field, B the magnetic flux density, k the wave vector, and w the frequency. 
The dispersion relation can be derived directly from the wave equation. The latter is obtained by 


computing the cross product of the wave vector and, e.g., the first of Eq. (3.1a) where the second 
equation has to be plugged in subsequently: 


k X (k X B) + cj k X E = k(k • B) - k^B + B = (w^ - k2)B = 0 . 


(3.2) 


Here the second of Eq. (3.1b) is used as well, which says that in a Lorentz-invariant vacuum the 


magnetic field is transverse. Equation (3.2) has nontrivial solutions for the magnetic field only in 
case of a;2 = k^, which immediately leads to the dispersion relation w = |k| of electromagnetic 
waves. The dispersion equation 


^2 _ k2 = 0 


(3.3) 
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is the base to determine the Finsler structure associated to standard Maxwell theory. The method 
is introduced in [60] and will be described as follows. Let M be a Finsler manifold and F = F{x, y) 
the corresponding Finsler structure with x € M and y G T^M where TxM is the tangent space at 
X. The indicatrix SxM at a point x of a Finsler space is the set of all y where the Finsler structure 
takes the constant value 1, i.e., SxM = {?/ G TxM\F{x,y) = 1}. Note that a Finsler structure 
defines an indicatrix, but conversely each indicatrix determines a Finsler structure |81) . 

Finsler himself expressed the idea that an indicatrix might model the phase velocity of light 
waves in both isotropic and anisotropic materials. Hence what is needed to associate a Finsler 
structure to a photon theory is an indicatrix [60] . The phase velocity vector is defined as Vph = kuph 
with Uph = w/|k| and the unit wave vector is k = k/|k|. Since Eq. (3.3) still depends on both 
the energy and the momentum components, we divide it by |kp. This results in an equation that 
involves the phase velocity and quantities of zero mass dimension: 




-1 = 0 . 


(3.4) 


Now Eq. (3.4) can be considered as the indicatrix of the associated Finsler structure that it still to 

Consider a 


be found. This is accomplished using Okubo’s technique, which is outlined in 
surface within a Finsler manifold M that is described by an equation /(x, y) = 0. A function F{y) 
taking a constant value 1 on such a surface can be found by solving the equation /(x, y/F{y)) = 0 
with respect to F(y) where the solution does not necessarily have to be unique. Denoting the phase 
velocity by Uph = |u| with u = we perform the replacement u* i—)■ tt*/E(u) and obtain 


from Eq. (3.3) 


u 


-1 = 0 . 


F(u)2 

The latter can be solved for F’(u) immediately: 


(3.5) 




jU^u^ , rjj = diag(l, 1,1) 




(3.6) 


As long as the intrinsic metric rij is positive definite, which is the case for the particular rjj given, 
''' fulfills all properties of Sec. Ill A1 Therefore it can be interpreted as a three-dimensional 


Finsler structure where the derived metric 




corresponds to the intrinsic metric. Since the 


Cartan torsion vanishes, it must be a Riemannian structure according to Deicke’s theorem. 


B. Isotropic case 

In the Lorentz-violating case modified Maxwell’s equations can be constructed by using Eqs. (4) 
- (6) of [72]. A Lorentz-violating vacuum behaves like an effective medium for electromagnetic 
waves, which is why Maxwell’s equations now involve nontrivial permeability and permittivity 
tensors. In momentum space they read as follows (where the spatial indices of k are understood 
to be upper ones): 

k X H -|- (uD = 0 , k X E — wB = 0 , (3.7a) 


k-D = 0 


k-B = 0. 


(3.7b) 
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The first two of these deliver relationships between the electric displacement D, the magnetic field 
H, the electric field E, and the magnetic flux density B. The transformation between (D,H) 
and (E,B) is governed by (3 x 3) matrices Kbe, i^db-, Khe, and Khb comprising the controlling 
coefficients and they are given by Eq. (4) in the latter reference. In the isotropic case considered 
here the matrices k^b and k^e do not contribute. It then holds that 


H — /i ^B , /r ^ — I 3 + Kjjg — I 3 — Kgg ) 


D — eE, e — I 3 + Koe ; 
Kdb = Ktr diag(l, 1,1) = -Khb 


e/r = diag(l, 1,1) 


n ^ = Jl = 


1 - Htv 
1 + 


(3.8a) 

(3.8b) 

(3.8c) 

(3.8d) 


Maxwell’s equations in momentum space will be needed to obtain the dispersion relations. Each of 
the equations involves different fields. However to obtain the dispersion relation, a single equation 


is required that contains one of the four fields only. Since according to Eq. (3.8) the different 


fields are related by matrices proportional to the unit matrix, the standard procedure outlined in 
Sec. IIII Al works here: 


k X (k X E) — a;(k x B) = k x (k x E) — a;^(k x H) 

= k X (k X E) + = k X (k X E) + ca^e^E = 0 , 

Writing the equation explicitly in matrix form leads to 


(3.9) 


^ — (^2 + /cf) kik2 


\ 


kik2 

kiks 


n 


— {kf + fel) 

k2k3 


kiks 

k2k3 


n 


— (kf + k^) 



(3.10) 


where E = {E^, , E^)'^ is the electric field strength vector. Lowering the indices of the compo¬ 

nents of k does not lead to changes, since the components always appear in bilinear combinations. 
The condition of a vanishing determinant of the coefficient matrix, which is demanded for the 
existence of nontrivial solutions for the electric field, leads to the dispersion equation 


2\2 


0 = - k") 


(3.11) 


From this we obtain the spurious solution a; = 0 associated to a nonpropagating wave and the 
modified dispersion relation oj = ^|k|. Now we again need an indicatrix. A reasonable choice to 
start with is Eq. (3.11). Dividing the latter by the prefactor and computing the square root does 
not change the set of physical zeros for lb, i.e., we can also take 

nV-k2 = 0. (3.12) 

A subsequent division by |kp results in the indicatrix of the related Finsler structure: 


(3.13) 
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Using Okubo’s technique we obtain -F(u) immediately: 

,2 


0 = 


u 


F(u) 




-^(u)l® = ±^\/u 2 = ±}-^rijV}W , Tij = diag(l, 1,1)^ , 


(3.14a) 

(3.14b) 


where the symbol ® denotes “isotropic.” Comparing Eq. (3.14b) to Eq. (3.6) we see that the only 
difference in comparison to the Lorentz-invariant case is the prefactor Xj Ji. This is not surprising, 
as the case considered is isotropic and the result involves the spatial velocity components only. 
Eor a positive definite rij, E(u)|@ fulfills all properties of a Einsler structure where the derived 
metric is given by 5 ® = rijj . Due to the isotropy the latter is still Riemannian, which can be 

explicitly checked via the Cartan torsion. In comparison to the Lorentz-invariant case it involves 
a global scaling factor. 


C. Anisotropic, nonbirefringent case 

The anisotropic case with a single modified dispersion relation reveals some peculiar properties. 
The matrices relating the different electromagnetic fields with each other are given by 


diag(l, 1 , — 1 ) — —Khb 1 


= Kf 


= 03, 


(3.15a) 


(3.15b) 


with the (3 x 3) zero matrix O 3 . The matrices Koe and Khb are diagonal as well, but the difference 
to the isotropic case is that they are no longer proportional to the identity matrix. This is not 
surprising due to the preferred spacelike direction ^ pointing along the third spatial axis where there 
is a residual isotropy in the plane perpendicular to this axis. Therefore the first two components 
of the diagonal matrix e/r are equal, but the third differs from those: 


en = diag(nf,n2,ni), 


ni = n2 = ^, ^3 = ® , 


® = 


'l-(3/2)Kii 


11 

e- 

’'e- 


(3.16) 


^ l + (3/2)Kii 

Now we again need an equation that can serve as a basis for the indicatrix of the associated Einsler 
space. Multiplying the second of Eq. (3.7a) with computing the cross product with k, and 


using the first of Eq. (3.7a) leads to an equation for the electric field vector: 


k X [(^"^(k X E)] + a;2eE = 0. 

Multiplying the latter with an appropriate prefactor, in matrix form it reads as follows: 
fuj'^ — ko — k'^n^ kik2 kik^n\ \ (E^\ /o^ 


V 


kik2 

kik^n^ 


UJ‘ 


— ki — k’^n'l k2k^n\ 
k2k^n\ (w^ — ki — ki) 



(3.17) 


(3.18) 


Lowering the components of k does not produce any changes. The determinant condition for this 
system of equations leads to 

0 = — , (3.19a) 


fc|l = k-C, A:^ = |k-fc||C|. 


(3.19b) 
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For convenience the three-momentum k is decomposed into a component along the preferred 
spatial direction ^ = (0,0,1)^ and into a component k_i_ perpendicular to C- This results in the 
spurious solution cj = 0 and a single dispersion relation for electromagnetic waves: 


w = ,Jk\ + <B‘^kl 


(3.20) 


Here the remaining isotropy perpendicular to the preferred direction becomes evident as well. 
The photon will only be affected by Lorentz violation in case it has a momentum component 
pointing along the preferred direction. Note that the result of Eq. (3.20) is very interesting from 
the perspective that the underlying Lorentz-violating framework is anisotropic, but in spite of this 
anisotropy there is only a single dispersion relation. In contrast, birefringence, i.e., the property 
of having two different dispersion laws dependent on photon polarization seems to always occur in 
anisotropic media in nature. The reason that there is a single dispersion relation here only is the 
extreme fine tuning of permeability and permittivity (cf. Eq. (3.15a)), which can most probably 
not be found in any materials. 

Now, the equation for the indicatrix follows from 


— k\ — = 0 . 


(3.21) 


in dividing it by k^. Introducing the angle 'd between the wave vector k and the preferred spatial 
direction ^ leads to 


0 = Uph - sin^ d - cos^ d , 


cos = k • ^ , k = — . 

k 


(3.22a) 

(3.22b) 


Thinking of d as the polar angle in spherical coordinates, Eq. (3.22a) can be reinterpreted using 

+ (u 2)2 


= u 


COST? = 


U 


sin-d = 


(3.23) 


|u| |u| 

as follows: 

- [(u^)2 + {u^f + = 0 . (3.24) 

The latter is the equation that determines the indicatrix. Okubo’s technique can again be used to 


obtain a Einsler structure directly when u is replaced by u/F{u) in Eq. (3.24): 


u 


-Fiuf [(nl)2 + (u2)2+®2(^3)2] 


This leads to the result 
T(u)|± = ±- 


u 


y(^l )2 + (y2)2+®2(^3)2 
which can also be written in the form 

/L/f 


(3.25) 


(3.26) 




^SijU^v) ’ 


rij = diag(l, 1,1)*^ , Sij = diag(l, 1, ®^) 




(3.27) 
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Here © means “anisotropic.” In principle the Finsler structure can be interpreted to involve an 
intrinsic metric rjj and a second metric Sij. Since the background considered is flat, it is reasonable 
to take Vij as the metric that determines the lengths of vectors and the angles between vectors. 
For general rij and Sij the derived metric is given by 


9ij = ^(u)| 


± d‘^F{yi)\^ dF{u)\ ^ dF{M)\^ 

® du^du^ du^ 


du^ 


9F(u)| 




= ± 


1 




{SabU°‘u’^Y/'^ 

3s 




(u)|| _ _ ^ _ uVu'-u" 

du^diiJ 


1 


{SabU°'u’^)^/‘^ 

Qiklm ~ im • 


Qiklm 


L.m ^ ^ ^mj^k^l 


U U 


(3.28a) 

(3.28b) 


(3.28c) 

(3.28d) 


This result is not very illuminating. When contracted with appropriate velocity components it 
collapses to (T(u)|q)^, which follows from its homogeneity of degree 2: 




152(T(u)|±^2 


iqv 
du^du^ 


Now the following properties of F(u)|q can be deduced: 


(3.29) 


1) F(u)|q > 0 if Vij is positive definite, 

2) F(u)|q G C°° for u G TM \ {0} as well as positive definite Sij, 

3) F(Au)|q = AF(u)|q for A > 0, i.e., positive homogeneity, 

4) and the derived metric gij is positive definite as long as Sij is positive definite. 


Therefore as long as both rij and Sij are positive definite, which in particular is the case 
for Tij and Sij given in Eq. (3.27), E(u)|q defines a three-dimensional Finsler structure, indeed. 
Furthermore both the Cartan and the Matsumoto torsion can be computed to be able to classify 
this Finsler structure. The results are complicated and they do not provide further insight, which 
is why they will be omitted. However they are nonzero in general whereby according to Deicke’s 
theorem, Eq. (3.27) is not a Riemannian structure and according to the Matsumoto-Hbjb theorem 
it is neither a Randers nor a Kropina structure. The result corresponds to Eq. (4.2.2.6) of [60] where 
a = ® and 6 = 1 in their notation. They denote this type of Einsler structure as a second-order 
Kropina structure in resemblance to a Kropina structure F{u) =0^1/3 with a = y/oijvfu^ and 
j3 = biU^. In the latter reference Eq. (3.27) appears in the context of light propagation in uniaxial 
media. The numerator involves the Euclidean intrinsic metric rjj only, whereas the denominator 
is characterized by another metric Sij. The latter could be thought of as the metric governing 
physics, since it involves the physical quantity ®. 
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D. Anisotropic, birefringent (at second order) case 


The penultimate example provides a case of modified Maxwell theory that has not been con¬ 
sidered in Sec. inn It is parity-even and characterized by two preferred spacelike directions: 


(cr) = 


(C2^) = 


Ci = 


smr/ 


/ 

0 

\cosr]j 


C 2 — 



(3.30) 


They are normalized and enclose an angle of 2r]. We consider an observer frame with one nonzero 
controlling coefficient Q. Then the (4 x 4) matrix employed in the nonbirefringent Ansatz reads 

= g (crC2" + CK^ - 2 (Cl • C2)r?^") . (3.31) 

This corresponds to the following choices for the matrices that appear in Maxwell’s equations; 

Ude = Q diag(l, cos(2ry), -1) = -Khb , (3.32a) 

Kob — i^HE — O3 • (3.32b) 

Hence there are nontrivial permeability and permittivity tensors, but the electric and magnetic 
fields do still not mix. Using these matrices, modihed Maxwell’s equations can be obtained accord¬ 
ing to the procedure used in Sec. IIII Cl The condition of a vanishing coefficient determinant for 
nontrivial solutions results in an equation for the dispersion relation: 

0 = a;2 [(1 - ^ -Qcosi2ri)]kj - (1 - Q^)kl - (1 - g)[l - gcos(277)]A:i] 

X {w^[l + gcos(2r/)]-(l-hg)A;f-[l-Fgcos(2??)]A:|-(l-g)A;|} . (3.33) 


In contrast to the anisotropic case considered in Sec. IIII Cl the current framework is characterized 
by two distinct modified dispersion relations. They can be written in the form 


Qi = 


Qi 


Qlkf + k2 + g2^3 ) 


(3.34a) 

\jQik\ + k2 + Q2k^ , 


(3.34b) 

1 — Qcos{2r]) 


1 — g cos{2ri) 

(3.34c) 

1 -g ’ 

1 + g ’ 

i + g 


i-g 

(3.34d) 

l + Q cos(2r/) ’ 

1 -h gcos(2ry) ' 


Evidently the contribution associated to the second three-momentum component stays unmodified 


which is reasonable, since the preferred directions of Eq. (3.30) do not point along the second 


spatial axis. Each dispersion relation can be expanded for g <C 1 showing that they differ at 


second order in Lorentz violation. In general the nonbirefringent Ansatz of Eq. (2.15) works at 


leading order only. Besides, the dispersion relations depend on the angle 2r/ enclosed by the 
two preferred directions. With the normalized propagation direction of the electromagnetic wave 
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given by k, the latter encloses the angles 6 i, 62 with the first and the second preferred direction, 
respectively. These are given by: 


cos 61 = li. ■ Cl = sin rj + cos i] , 
cos 62 = ^ ■ C 2 — ~k^ sin 7 ] + cos rj. 


(3.35a) 

(3.35b) 


The components of the propagation direction vector k can now be expressed in terms of the angles 
61 , 62 , and r/. Note that k is a unit vector by construction: 


yi cos 9i — cos 02 

k = - 

2sinr/ 


yo cos 0 i + cos 02 ^2 

k = 


2 cos rj 


k^ = yl — {k^Y — {k^Y ■ 


(3.36) 


Now the two individual factors of Eq. (3.33) are considered giving the modified dispersion relations. 
Dividing each by the wave-vector magnitude |k|, introducing the phase velocity, and expressing all 
propagation direction components by the angles of Eq. (3.36), equations for the phase velocities 
are obtained as before: 


0 = (1 - Q )vL + — {4cos(6»i) cos( 6'2) - Q [cos(2(9i) -b cos(26'2)]} - 1, 


0 = [1 + g cos( 27/)] (1 - Uph) - 2Q cos(6»i) cos(6»2) 


(3.37a) 


(3.37b) 


In dividing the second equation by — [1 + Qcos{2r])] and expanding both equations to linear order 
in Q these results correspond to each other as expected. Now we are in a position to interpret the 
latter equations geometrically, which will lead us directly to the Finsler structures associated to 
this particular sector. In doing so, the velocity u is introduced and both the phase velocity and 
the angles 0 i, 02 are expressed by the magnitude or components of u as follows: 


— 


u 


1 

. u 

cos 61 = —r Sin T] + ^—r cos T] , 

u u 


1 

„ u 

cos 62 = — 1—r sin ri + -—r cos ri. 

u u 


(3.38a) 

(3.38b) 

(3.38c) 


Inserting those into Eq. (3.37a) and using Okubo’s technique leads to two distinct Finsler struc¬ 
tures. The first is given by 

/1 \ I . . 'u^'uiP 

^(u)|® = ± 7 = diag(I, I, l)ij , Sij = diag(gi, I, Q 2 )ij , (3.39) 

y Si ju'-u^ 

and the second reads as 


F(u)|g)^ = ±- 




TijU U- 


nj = diag(I, I, l)ij , Sij = diag(gi, I, Q 2 )ij ■ 


(3.40) 


y Si jU^-y 

Here ® means “anisotropic and birefringent (at second order).” The four Finsler structures ob¬ 
tained have a form analogous to the Finsler structure found in Eq. (3.27) of Sec. IIII Cl This is 
not surprising, since both sectors are anisotropic but parity-even. Having birefringence at second 
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order in Lorentz violation does obviously not affect the form of the Finsler structure. In such a 
case we can obtain several distinct Finsler structures that differ from each other at second order 
in the controlling coefficients via the metrics Sij. 

In Eq. (3.27) the latter Sij differs from the standard Euclidean metric only by the component 
S33. Here both sn and S33 are modified by Lorentz violation where they also depend on the angle 
r] enclosed by the two preferred directions. The component S 22 is standard, which again reflects 
the fact that the preferred directions have a vanishing second component. Since Sij involves the 
physical (dimensionless) constants Qi and Qi for i = 1... 2, it is reasonable to say that Sij seems 
to govern the physical properties of photon propagation in these cases. 


E. Parity-odd case 


The final interesting sector considered involves the three parity-odd coefficients and 

and it will turn out to be the most complicated one. The preferred spacetime directions are 
given in Eq. (2.21b) and the matrices relating the electromagnetic fields to each other read 

= O 3 ) >^HB = O 3 ) (3.41a) 


( ° 

~12 

^ 0 + 





0 

SS 

_ T _ 

1 — ^DB — ^DB • 

(3.41b) 

V 

~23 

0 




The relationships between the fields are given by 


E) — E -|- , 


(3.42a) 


H = KifsE + B = KdsE + B . (3.42b) 

In contrast to the aforementioned cases the parity-odd case has the peculiarity that the electric fields 
mix with the magnetic fields. Therefore obtaining an equation for the electric field from Maxwell’s 
equations is more involved here. Nevertheless it can be accomplished along the following chain of 
steps: 


0 — X E) cjk 25 £;B , 

(3.43a) 

0 = KobO^ X E) — a;(D — E), 

(3.43b) 

0 = KDs(k X E) + k X H + uiB , 

(3.43c) 

0 = X E) -)- k X -)- B) caE , 

(3.43d) 

0 = KDs(k X E) + k X (kdsE) + —k x (k x E) + wE . 

(3.43e) 


OJ 


Inserting the explicit vectors and a subsequent multiplication with a; leads to the following system 
in matrix form: 



{A + B) 


(e^ 


(3.44a) 
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A = 


— (^2 + ^i) kik2 kik^ 

kik2 - {kf + /c|) k2k3 

\ kik^ k2k3 u? - (kf + kl)^ 


(3.44b) 


(3.44c) 


The total system can be completely decomposed into the standard part of A and a Lorentz-violating 
contribution comprised in B. Note that here B gets a global minus sign when lowering the indices 
of the k components. Therefore the determinant condition results in the following equation for the 
photon energy: 



(- 

2 (Kj^+fe2 + K^+fes) 

Ko+A:1 + Ko+^2 

+ Ko+fes 

B = to 


-h ^0+^2 

-2(k23_A:i -PK^^fes) 

Ko+/C2 + 


1 

Ko+fei -h ^23/C3 

Ko+^2 + Ko+^3 

-2(^23 fci + 153 )^/ 02 ) 


UJ 


■2(a;2 - 2a; C • k - k^) [(w - C • k)^ - (1 + C^)k2] = 0 . 


(3.45) 


Here C = (ko+^'^o+^^o+)'^ i® spatial part of the second preferred spacetime direction and k is 
understood to have lower components. The second and the third of the three factors can be solved 
for the energy giving two distinct dispersion relations: 


(ui = C-k+Vk2 + (C-k)2, 
a;2 = C • k + + C^|k| , 

COS 79 = C-k, C=|, ^ = 1^1 = V + («o+)^ • 


(3.46a) 

(3.46b) 

(3.46c) 


For convenience it is again reasonable to set up the coordinate system such that ^ points along its 
third axis where 'd is the angle between the wave vector k and the spatial direction. Dividing the 
first factor of Eq. (3.45) by k^ then leads to 


— 2guph cos — 1 = 0 . 

Introducing spherical polar coordinates with Uph = |u| results in 

- 2<gu^ -1 = 0. 


(3.47) 


(3.48) 


This is the indicatrix for the first Finsler space that can be associated to the parity-odd case. We 
can employ Okubo’s technique to obtain 


0 = - 2gF{u)u^ - F{uf 

F(u)|^^)^ = -gu^ ± v'u2 + (<g#)2 , 

= -C -uT v^u2-h(C-u)2. 


(3.49a) 

(3.49b) 

(3.49c) 


where Eq. (3.49c) is the generalization of Eq. (3.49b) for ^ pointing along an arbitrary direction 


and the symbol (8> denotes “parity-odd.” Without loss of generality the properties of the Finsler 
structure can be investigated with pointing along the third axis of the coordinate system, which 
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simplifies the calculations. The derived metric is again lengthy and does not seem to provide any 
deeper understanding. The derived metric contracted with the spatial velocity components leads 


to the square of Eq. (3.49c): 


(3.50) 


j , du^dui 

Therefore the following properties of T(u)|^^^ in Eq. (3.49c) can be deduced: 

1) > 0 for u E TM \ {0}, 

2) E for u E TM \ {0}, 

3) F(Au)|^^^ = AF(u)|^^^ for A > 0, and 

4) the derived metric of F(Au)|^^^ is positive definite for u E TM \ {0}. 

Due to the first item, only T(u)|^^'*~ is a Finsler structure. Its Matsumoto torsion vanishes, 
whereas the Cartan torsion does not. Furthermore when taking into account its form, T(u)|^^'*~ 
must be a Randers structure. This particular type of geometry was introduced by Randers to 
account for the fact that particles always move on timelike trajectories pointing forwards in time 
|82j . In contrast to General Relativity his framework incorporates an additional four-vector into 
the metric. However this four-vector should not be considered as a preferred spacetime direction, 
since it can be changed by a kind of gauge transformation without affecting the arc length travelled 
by a particle. In the Lorentz-violating case considered here ^ is a preferred direction, indeed. 

The parity-odd framework is characterized by both a preferred timelike and a spacelike direction. 


cf. Eq. (2.21b). For the isotropic and anisotropic cases, which are parity-even, the corresponding 


Finsler structures are expected to involve only bilinear expressions such as atju^u^, since these are 
invariant under u* i—)• n'* = — n*. Due to parity violation the Finsler structure of the parity-odd 
case is expected to involve terms such as bjU*, though. The Randers structure is a very natural 
possibility with this property, but it is not the only one as we shall see below. 


The Finsler structure of Eq. (3.49c) has the same form as the corresponding dispersion relation 


of Eq. (3.46a) not taking into account additional minus signs. Such structures could be called 


“automorphic.” They seem to appear when the dispersion equation (here Eq. (3.45)) involves one 
additional parity-odd contribution. 

The parity-odd case of modified Maxwell theory has a second indicatrix, which follows from the 


second factor of Eq. (3.45) using the same procedure: 


^ph 


— 2guph cos -d + 8'^ cos^ 7? — (1 -|- g^) = 0 , 






Okubo’s technique leads to 


F(u)|g>± = ^ 


u 


l + g2_ (git3)2/u2 


(3.51a) 


(3.51b) 


(3.52a) 


\|(2)C± _ -C • U ± VT+~P|u| 

“ l + g2_ (^.u)2/u2 • 


(3.52b) 
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Let us investigate the characteristics of Eq. (3.52a). We again obtain 


= (F(u)|g^^)2, 


,( 2 )± = _ 


l92(F(u)|g)^)2 


du^du^ 


(3.53) 


Hence for E(u)|^' analogue properties hold such as for Eq. (3.49c), which makes it to a Finsler 


structure. Note that the latter is not automorphic, since its off-shell dispersion relation in Eq. (3.45) 
does not exclusively involve additional parity-odd terms, but also contributions like • k)^. For 
this structure the Matsumoto torsion does not vanish, which is why it is neither a Randers nor a 
Kropina structure. The deviation from a Randers structure is of second order in the controlling 
coefficients: 


= -C • u ± vTT^|u| -k ©(ko+) . 


(3.54) 


Recall that the massive-photon dispersion equation of this mode, Eq. (2.23), was not quadratic. 


but quartic. For this reason it was challenging to derive a classical point-particle Lagrange function 
corresponding to the second photon polarization. It is also interesting to note that a large number 
of complications arise in the quantum field theory based on the parity-odd framework due to the 
behavior of this mode [24]. On the contrary the first mode is much easier to handle. The Finsler 


structures obtained seem to reflect these properties. The first, given by Eq. (3.49c), is a well 


understood Randers structure, whereas the second deviates from such a structure at second order 
in Lorentz violation, which makes its properties much more involved to analyze. 

The studies carried out in the current section will prove to be useful when describing photons 
in the geometric-optics approximation. Thereby the eikonal equation will play an important role. 
How all these concepts are linked to each other will be clarihed in the forthcoming part of the 
article. 


IV. CLASSICAL RAY EQUATIONS 

Propagating electromagnetic waves can be treated in the geometric-optics approximation as 
long as their wave lengths can be neglected in comparison to other physical length scales. For 
example this is possible for waves with low energies propagating over large distances when physical 
phenomena related to the wave character (such as diffraction) do not play a role. This physical 
regime could be called “classical” and the wave then corresponds to a geometric ray. The goal of 
the current section is to establish ray equations that describe the physical behavior of propagating 
rays. 

Each electromagnetic pulse has a wavefront, which separates the region with nonzero electro¬ 
magnetic fields from the region with vanishing fields. At any instant of time the wavefront can be 
considered as a two-dimensional surface in three-dimensional space, i.e., it can be described by an 
equation of the form 'ip{x) = t where x are spatial coordinates and t is the time. The gradient 
points along the propagation direction and it is perpendicular to the surface. There is a relation 
between and the refractive index n of the medium; it reads as |V'!/>| = n. The latter is called 
the eikonal equation in a subset of the literature. In what follows, n is assumed to depend on the 
position X only, but not on the velocity u, i.e., n = n(x). 

Consider a wave propagating along a trajectory x(s) where s is the arc length of the curve. In 
this parameterization the tangent vector has magnitude 1, which is why the ray equations read as 
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follows: 


dx VV’ dx 

dJ = iwI’ 


(4,1) 


Computing an additional derivative of the latter with respect to s, its right-hand side can be 
expressed in terms of the refractive index as well: 


Av^/i = ( A . V ) VV' = -Vip ■ [V(VV’)] = —ViVi/jf = —Vn^ = Vn, 
ds V ds / n 2n 2n 


(4.2) 


Trajectories may not necessarily be parameterized by arc length. For an arbitrary parameterization 
with parameter t we obtain 


d dt d 
ds ds dt 


ds\ ^ d 


dt 


dt 


AA 

|u| dt 


Now the ray equations (4.1) can be cast into the following final form: 


d / dx\ 


(4,3) 


(4.4a) 


dt 


(V?/)) = |u| Vn , VV’= n 


u 


u 


(4.4b) 


The literature seems to be discordant about which equation should actually be called the eikonal 


equation. Some sources call the hrst one of Eq. (4.4b) the eikonal equation, whereas others denote 


it as the vector magnitude of the second one. Note that the latter leads us back to | VV'I = n (cf. 
the beginning of this section). In the current paper whenever referring to the eikonal equation, we 


will be talking about the first one of Eq. (4.4b). For clarity, the vector magnitude of the second one 


will be called the wavefront equation. Equation (4.4b) can be understood as the Euler-Lagrange 


equations resulting from the condition that the following functional becomes stationary: 

L[x, u]= f dsn(x) = f dtV , E = E(x, u) = n(x)|u| . 

Ja Jta 


(4.5) 


The integrand of this functional is the infinitesimal optical path length and the functional itself 
gives the total optical path length travelled by a ray along its trajectory between two points A and 
B. Here Ta is the departure time of the ray at A and Tb the arrival time at B. The optical path 
length is defined to be the path length equivalent that light has to travel in vacuum to take the 
same time as for a given path in a medium with refractive index n(x). The quantity V could be 
interpreted as the corresponding “optical velocity.” The functional of Eq. (4.5) can be understood 
as the base of the Fermat principle, cf. 


A. Wavefront and eikonal equation in modified Maxwell theory 


The analogue of the wavefront equation in Eq. (4.4b) in the context of modified Maxwell theory 


was partially studied in [85]. The authors of the latter reference chose the coefficients contained 
in Koe and as nonvanishing where both the trace of these matrices and the matrices mixing 
electric and magnetic fields were assumed to be zero. The trace components can be restored without 
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any effort by just replacing their /3 b by Koe and their /3 _b by Khb- The wavefront equation then 
follows from the matrix Mg in their Eq. (38): 

= (1 - I VV’P) 5^^ + , (4.6) 

where is the totally antisymmetric Levi-Civita symbol in three dimensions with = 1. This 
matrix is multiplied with the time derivatives of the fields, which are singular on the wavefront. 
Therefore their Eq. (33) can only have nontrivial solutions if the determinant of Mg vanishes. 
This condition directly leads to the wavefront equation within the framework considered. Eor 
the isotropic case (cf. Sec. IIIIB|l . the anisotropic, nonbirefringent case (cf. Sec. IIII Cl) , and the 
anisotropic, birefringent sector (cf. Sec. IIIIPp we obtain 


l = (4.7a) 

l = (5V)^ + (5^^)^ + ®^(9^V’)^ (4.7b) 

l-Q^ = I VV’I^ + S {(5V)^[1 - cos(2r/)] - (9^V’)^[1 + cos(2r/)]} 

- {((9^V’)^ + [(^V)^ - cos( 27/)} , (4.7c) 

1 + Q’cos( 27/) = I VV’I^ + C cos(2r/) — (c^^V')^} . (4.7d) 


These are the analogues of the wavefront equation |V'0p = in modified Maxwell theory, hol¬ 
lowing the lines in connection to Eq. (2.30) classical Hamilton functions can be obtained as parts 
of the Hamilton-Jacobi equation describing a classical ray. Eor the sectors considered few lines 
above they read as follows: 


H\@ = , (4.8a) 

^1® = -\lkl + kl + (B^kl, (4.8b) 

\/l - = -{k^ + Q {kl[l - cos(2r/)] - A:|[l + cos(2r/)]} 

- 0^ [kl + {kl - kl) cos ( 277 )] | ' , 

= + + (4.8c) 

\/l gcos(2r/)i7|J^^^ = + Q[kl + kl cos{2r]) - kl] , 

H\f = -^lQikl + kl + Q2kl, (4.8d) 


with Qi, Q 2 of Eq. (3.34c) and Qi, Q 2 taken from Eq. (3.34d). These Hamilton functions are directly 
linked to the modified dispersion relations, cf. the paragraph below Eq. (3.11) for the isotropic 
case, Eq. (3.20) for the anisotropic (nonbirefringent) sector, and Eq. (3.34) for the anisotropic 
(birefringent) case. This nicely demonstrates that all computations are consistent with each other. 

The wavefront equations (4.7) are not suitable for our calculations, since they involve first 
derivatives of the wavefront that are unclear how to be treated. Having the eikonal equations 
involving the refractive indices and velocity components only would be of advantage. As a cross 
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check with the previously obtained results the refractive indices can be derived from Eq. (4.7). 
For the isotropic case, using the second of Eq. (4.4b) we obtain IVV'P = which by inserting 


into Eq. ( |4.7a ) directly leads to the isotropic result n|@ = 1/J7. In Eq. (4.7b) of the anisotropic 
(nonbirefringent) sector we can introduce 


sin^ 'd , = n cos , 


(4.9) 


leading to n|Q = l/\/ sin^ i? + cos^ i?. The latter depends on the angle between the propagation 
direction and the preferred direction For the anisotropic (birefringent) sector we insert 


= n 


cos 6*1 — cos 02 „3 , cos 6*1 + cos 02 


2 sinr/ 


= n- 


2 COST] 


, = 'Jn^ — , (4.10) 


both in Eq. (4.7d) and Eq. (4.7c) to obtain two refractive indices differing at second order in 


Lorentz violation: 


= 


i-g2 


1 + ^ {(^/ 2 ) [cos( 20 i) + cos( 202 )] — 2 cos 0i cos 6 * 2 } ’ 


(4.11a) 


1 ( 2 ) 

n h' = 


1 + Q’cos(277) 


1 + Q[cos{2ri) — 2 cos 0i cos 02 ] 


(4.11b) 


Based on these refractive indices the integrands of the action functional in Eq. (4.5) can be com¬ 


puted. The results are consistent with Eqs. (3.14b), (3.27): 

1 


E(u)|® = n|®|u| = ^vu2 = F(u)|+ , 


(4.12a) 


^(u)|® = n|®|u| = 
= F{u)\+, 


V(ul)2 + (u2)2 + (^3)2 _ (nl)2 + (^2)2 + (ii3)2 


\/sin^ 1 ? + ®2 cos 2 1 ? \/ (ul )2 -h (n 2)2 ® 2 (u 3)2 




(4.12b) 

(4.12c) 


where for the latter two Eq. (3.38) has to be employed. The refractive indices obtained from the 


wavefront equations correspond to the refractive indices computed directly from their definitions 


via the inverse phase velocity: n = = |k|/w. 


^1® — '^ph|@ ~ , ,1 ~ ^ ’ 


cui® 


1 |k| / k2 1 

y kl+^^kf v^sin^ 1 ? + ®2 cos 2 i9 ’ 


«l®^ = Khl®^) ^ = jy 

wik 


«l®^ = (^'phl®) ^ = 


W 2 I 


(4.13a) 

(4.13b) 

(4.13c) 


These previously performed studies do not reveal any inconsistencies. The essential conclusion is 
that it should be warranted to describe the isotropic, anisotropic (nonbirefringent), and anisotropic 
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(birefringent) sectors of modified Maxwell theory (in the geometric-optics approximation) with an 


adapted version of the eikonal equation, Eq. (4.4b). 


Last but not least the parity-odd sector of Sec. IIIIEI shall be elaborated on. The wavefront 
equations for the parity-odd case were not derived in [85] , since in the latter reference all controlling 
coefficients mixing electric and magnetic fields were set to zero. Adapting the procedure used allows 
to derive them nevertheless. The authors of [85] consider the values of the fields directly on the 
wavefront, e.g., for the electric field: Eo(x) = E(t, Iii what follows, all fields evaluated 

on the wavefront will be denoted by an additional “0” as an index. The spatial derivative on the 
wavefront is then given by: 


aEn 


5E 

dx^ 


E 


d'4} 


(4.14) 


dx^ dx^ dx^ 

Based on this procedure, from Maxwell’s equations four equations can be derived that involve field 
components on the wavefront and field derivatives only: 


V X Eq = — B -I- VV’ X E, V X Ho = D -I- V-i/i x H, 


(4.15a) 


V • Do = • D , V • Bo = V?/) • B , 


(4.15b) 


D = E-I-k^ibB , H = KobE-1-B. (4.15c) 

These must be combined to obtain an equation that involves the time derivatives of only a single 
field, e.g., the electric field and field values on the wavefront that may not necessarily include only 


a single field. This can be carried out via the following chain of steps: 

V X Eo = —H -I- KoflE -I- VV' X E , (4.16a) 

VV’ X (V X Eo) = —VV’ X H -I- V-i/i x k^sE -|- V'i/’ x (VV’ x E), (4.16b) 

V'i/’ X (V X Eo) = D — V X Ho -I- V?/: x k_dsE -|- VV’ x (VV' x E) , (4.16c) 

VV' X (V X Eo) = E -|- KosB — V X Ho -I- x KdsE -|- VV' x (VV' x E), (4.16d) 

VV' X (V X Eo) = E -I- Kos(VV' X E — V X Eo) — V x Hq 

-|- VV' X KosE -|- VV' X (VV' X E). (4.16e) 

The resulting equation then reads 

E -I- KdsV'V' X E -I- VV' X KobE -I- VV' X (VV' X E) = VV' x (V x Eo) -I- V x Ho 

+ Kdb^ X Eo . (4-17) 


The condition for a vanishing determinant of the matrix on the left-hand side for the existence of 
nontrivial solutions leads to the wavefront equation for the parity-odd case. For consistency we 
pull the index of VV' down: 


(1 - 2C • VV' -1VV'P) [1 - (1 + C')|VV'I' - 2(C • VV') + (C • vV')'] = o. 
Inserting the second of Eq. ( |4.4b ) in the first factor of Eq. (4.18) results in 
1 — 2^ • V-V — I VV'P = 1 — 2n(^ • u — = 0. 


(4.18) 


(4.19) 
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The latter can be solved with respect to the refractive index n to give 

= — C • u + Y^l + (C • u)2 = —g cos'd + \/l + cos^ , (4.20) 

where only the positive-sign solution delivers a physically meaningful refractive index. Hence the 
result obtained from the eikonal equation is consistent with Eq. (3.49c), which can be seen upon 
close inspection: 


= n||^^Vl = -c • u -b Vu2 (C • u)2 = F{u 


iC(i)+ 


The same procedure applied to the second factor of Eq. (4.18) leads to: 
1 — (1 + — 2n(C • u) + n^{C • u)^ = 0 . 


(4.21) 


(4.22) 


Therefore the refractive index reads 


n 


C(2) —C • U + Vl + —S COST? -b Vl + 


l + <g2-(C-Q)2 


1 -b sin^ "d 


which is consistent with Eq. (3.52b) 

V{u)\f^ 


■ n 


|<( 2 ) 


u 


u -b \/r 

1 +<g2 


-c- 


Vl + S^\u\ _ p. ^|C{2)+ 

(C-u)2/u2 • 


(4.23) 


(4.24) 


The refractive indices obtained from the wavefront equations for the isotropic and anisotropic cases, 


Eqs. (4.7a) - (4.7d), respectively, are consistent with the usual definition of the refractive index 


via the inverse phase velocity (cf. Eqs ( 

4.13a 

)-( 

4.13c 

). However this does not seem to be the case 

for the parity-odd sector. Inspecting Eqs. (3.46a 

), ( 

3.46b 

) and the latter results for the refractive 


indices of Eqs. (4.20), (4.23) reveals the inconsistency: 
|k| 


1 


Wl|(g) g cos I? + Vl + COs2 -d 


/ n 


C{i) 


(4.25a) 


|k| 

W2|(g) 


£cosi? + v^r^^ 


(4.25b) 


The definition of the refractive index via the inverse of the phase velocity rests on the existence 
of a nonzero permeability and permittivity. However for the parity-odd case they both vanish and 
the electric fields even mix with the magnetic fields, which is why the ordinary definition of the 
refractive index does not seem to be reasonable. A further origin of the issue may be that Okubo’s 
method does not produce Einsler structures in a unique manner. We conclude that it may be 
problematic to treat the parity-odd case of modified Maxwell theory with the eikonal equation. 
Einding a solution to this clash is an interesting open problem. 


V. GRAVITATIONAL BACKGROUNDS 

The physics for a classical point-particle equivalent to a massive fermion rests on its Lagrangian. 
The procedure of deriving those within the framework of the SME works for massive particles only 
where in the limit of a vanishing particle mass the Lagrangian goes to zero. So far we have 
demonstrated that the important quantity to describe the physics of electromagnetic waves in the 
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geometric-optics approximation is the refractive index. The reason is that the motion of photons 
is much more restricted than the motion of a massive particle. After all, for a particle with mass 
moving in a potential the initial position, direction, and velocity can be chosen freely. On the 
contrary, for a photon the initial position and direction only are not fixed, whereas its initial speed 
is determined by the refractive index at its starting point. 

In the previous sections it was shown how to establish connections between various cases of 
the minimal SME photon sector and certain Finsler geometries. The Finsler geometries found 
were discovered to be closely related to the various refractive indices where only for the parity-odd 
case of the CPT-even sector such a connection is not manifest. The refractive indices found are 
independent of the spacetime position such as the controlling coefficients, which corresponds to 
the analogue of a homogeneous medium in optics. However the refractive index can depend on 
the three spatial velocity components. In other words, in such cases the refractive index depends 
on angles enclosed between the propagation direction and preferred directions. This situation 
is reminiscent of anisotropic media in optics. Hence Finsler structures related to the CPT-even 
photon sector are three-dimensional in contrast to the Finsler structures obtained from Wick- 
rotating classical Lagrangians of massive particles. Besides, note that in the photon case no Wick 
rotation is necessary, since the intrinsic metric involved is already of Euclidean signature. 

These results shall serve as a base to study light rays in the geometric-optics approximation in 
the presence of Lorentz violation. As we saw, for most cases these can be described by the eikonal 


equation, cf. Eq. (4.4b): 


ds 


n 


ds 


= VxU, 


(5.1) 


where n is the refractive index of the medium considered. On the right-hand side the gradient is 
understood to be computed with respect to the position vector x. The photon trajectory is given 
by X = x(s) and it is parameterized by the arc length s. For an isotropic and homogeneous medium 
the refractive index is a mere constant. In this case one immediately sees that the resulting ray 
equation is 


d^x 

ds^ 


= 0 , 


(5.2) 


whose solution is a straight line as expected. For homogeneous, but anisotropic media the refractive 
index depends on at least one angle, n = n('d), where further angles are suppressed for brevity. 
For a straight ray trajectory the angle 'd is fixed by the initial direction and it does not change 
during propagation, i.e., it is not a function of s. Furthermore due to homogeneity the refractive 
index does not change along the trajectory as well, which is why Vxn('i?) = 0 for points on the 


trajectory. Therefore in this case we again end up with Eq. (5.2). For inhomogeneous media with 


n = n(x) the eikonal equation cannot have straight-line solutions, though. 

In what follows the formalism and knowledge attained shall be applied to propagating light rays 
in curved spacetimes with metric = gp,u{x). The trajectory of a ray in a spacetime is described 
by a four-vector = x^{s) and it propagates with the four-velocity = dx^/ds. Propagation 
occurs along geodesics combined with the nullcone condition = 0 that has to hold locally 

at each spacetime point. For practical reasons, which will become clear in the course of the current 
section, all forthcoming investigations will be performed in a spacetime characterized by a line 
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interval of the form 
dr^ = ^ 


A{r,9,cj>) 


— j4(r, 6, (p){dr^ + r^d9^ + sin^ 9d(j)^). 


(5.3) 


Here t is the time, (r, 9, (j)) are spherical coordinates, and H is a time-independent function. Such 
metrics were proposed in [86] and they are denoted as “generally isotropic” where metrics with A = 
A{r) are called “spherically symmetric.” The parentheses in the spatial part of Eq. (5.3) give the 


volume element of a three-dimensional ball and it is multiplied by A(r, 9, (f>). The choice A{r, 9, 4>) = 


1 in Eq. (5.3) describes Minkowski spacetime in three-dimensional spherical coordinates. In this 


case the spatial coordinate surfaces with constant r are two-spheres. For arbitrary A(r, 9, (p) these 
surfaces are still two-spheres topologically, but their local geometry depends on r, 9, and cp. Note 
that the metric describing a weak gravitational field can be brought into the generally isotropic 
form: 


(5.4) 


(g^u) = diag(^(l + 2$), -(1 - 2d>), -(1 - 24>), -(1 - 2$) 

= diag(^^, -(1 - 2<1>), -(1 - 2$), -(1 - 2ci>)) + ©(ch^) 


Here <I> = <h(r) = —GM/r <C 1 is the Newtonian potential. 

In the latter paper |86| it was shown that there is a link between the eikonal equation of the 
geometric-optics approximation and the null geodesic equations of a spacetime based on a line 


interval of Eq. (5.3). A suitable combination of the geodesic equations leads to 


ds 


Hx 

A{r,9,cP) — 


= VA{r,9,cP), 


(5.5) 


i.e., A{r,9,(p) of Eq. (5.3) can be understood as an inhomogeneous and anisotropic refractive in¬ 


dex. Therefore as long as weak gravitational fields are considered, light behaves according to the 
geometric-optics approximation. The approximation is expected to break down as soon as strong 
gravitational forces appear such as in the direct vicinity of a black hole. In this case the original 
geodesic equations have to be studied instead of the eikonal approach. Note that the converse is 
true as well. If the eikonal equation is known to be valid (also in flat spacetime) this corresponds 


to a propagating ray in a generally isotropic spacetime of Eq. (5.3) 


A. Isotropic case 


The eikonal approach has a great potential to be applied to the propagation of light rays in 
a weak gravitational field permeated by a Lorentz-violating background field. It is reasonable to 
start with the simplest case, which is the isotropic one investigated in Sec. IHIBl With the constant 


refractive index n = 1/J? (in Minkowski spacetime) given by Eq. (3.8) or Eq. (4.13a) the eikonal 
equation and the corresponding spacetime, Eq. (|5.3|), read as follows: 


ds 


1 dx 
ds 


(5.6a) 


dr^ = ^df^ —+ r^d0^ -|- sin^ 9d(p‘^). 


(5.6b) 
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FIG. 1: Deflection of light near a massive body, e.g., the planet Jupiter. (The picture of Jupiter was taken 
by the Cassini spacecraft, cf. http;//solarsystem.nasa. gov/planets/prof ile. cfm?Object=Jupiter,) 


The coordinate surfaces of the associated spacetime are spheres whose radii are scaled by l/\/^. 
This intermediate result can now be used to introduce a gravitational background. Via the principle 
of minimal coupling the flat Minkowski metric is replaced by a curved spacetime metric, i—)• 
g^uix), and the constant refractive index n is promoted to a spacetime-position dependent function: 


n I—>■ n{r, 9, (/>). The curved spacetime metric is taken to be Eq. (5.4) for a weak gravitational field. 
Since the latter is spherically symmetric, it is reasonable to assume spherical symmetry for the 
position-dependent refractive index, i.e., n(r) = l/JK{r). The corresponding eikonal equation and 
the line interval then read as follows: 


ds 


dx' 
n(r) — 
ds 


= ^n{r ), n(r) = 


1 - 2<l>(r) 
Jl{r) 


(5.7a) 


dr2 = 


n{r) 


— n(r)(dr^ -|- r^d0^ -|- sin^ Odcj?). 


(5.7b) 


Hence the minimal-coupling principle amounts to a refractive index that is the product of a spatial 
component of the weak gravitational field metric and the spacetime-position dependent refractive 
index l/^(r) associated to the isotropic Lorentz-violating framework considered. 

The approach introduced has a paramount advantage. The physics of a Lorentz-violating pho¬ 
ton in a (weak) gravity field can be studied without field theory and the geodesic equations in a 
curved spacetime. Instead, a classical method is used replacing photons by light rays and working 
in the geometric-optics approximation with the eikonal equation. In this context Lorentz sym¬ 
metry violation is treated as explicit, which is known to clash with the existence of gravitational 
backgrounds [32] ■ The latter sections VI and VII will be dedicated to this issue where for now we 
will delve into phenomenology. 

One possible application of the used approach lies in the (modified) deflection of light in the 
vicinity of a massive body (cf. Fig. [p, which is an important test of gravitational theories. From 
a technical point of view the eikonal equation is nonlinear, which makes it challenging to solve 
analytically in general. However for the isotropic case, i.e., a refractive index only depending on 
the radial coordinate r the formula of Bouguer follows from the eikonal equation (see, e.g.. Sec. 3.2.1 
of IHHI): 


n(r)r since = C . 


(5.8) 
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Here C is a constant and a the angle between the tangent vector of the trajectory and the radial 
vector pointing from the coordinate origin to a particular point on the trajectory. The latter 
relationship is the equivalent of energy and angular momentum conservation for a massive particle 
in classical mechanics. Since both the distance r of a particular point from the origin and the angle 
a associated to this point does not depend on the parameterization of the trajectory, we choose to 
parameterize it by spherical coordinates. Thereby the problem is restricted to the x- 2 ;-plane with 
0 = 'kI‘1. The trajectory then reads x = re,- where r = r{4>) and = er{4>) is the unit vector 
pointing in radial direction. The angle a is given as follows: 


sm a = 


r{4)) 


dr 


r = 


+ r^(0) ’ d(/) 


(5.9) 


Now the formula of Bouguer delivers a differential equation for ^{r). Its solution is obtained by 
solving the latter with respect to d(/>/dr and by performing a subsequent integration: 


4){r) = C 


dr 


'd ry/n{r)‘^r^ - ’ 


(5.10) 


where d is the distance of minimal proximity and the condition (j){d) = 0 has been set. By doing 
so, solving the eikonal equation has been reduced to computing a one-dimensional integral. Now 
consider a classical light ray approaching a massive body with impact parameter d^o, which is the 
distance between the particle propagation direction in the asymptotically flat region and the parallel 
going through the center of mass of the body (at the coordinate origin). The photon will travel 
such that its distance to the body steadily decreases until reaching a minimum where it increases 
again afterwards. At the minimum distance d we have that r = 0 and therefore a = 7r/2. The 
minimum distance corresponds to the impact parameter to a very good approximation: d ~ doo- 


This is why Eq. (5.8) immediately tells us that 


C = n{d)d Ri n(doo)do 


(5.11) 


Without the massive body the change A(f) in the angle would be equal to tt for a photon coming from 
an asymptotically flat region, passing near the coordinate origin, and propagating back to inhnity. 
Due to the body there is a deflection, which changes A(/> to an angle that is slightly larger than 


TT. Performing the integration in Eq. (5.10) from r = d to infinity gives half of this contribution. 


since it only takes into account the second half of the trajectory. Therefore the deflection angle (p 
is given by 

dr 


ip = Acj) — TT = 2C 


d ry/n{r)‘^r'^ — 


— TT . 


(5.12) 


It can be checked that Eq. (5.12) gives (/? = 0 for n(r) = 1 as expected. Eor a constant refractive 


index n it holds that C = nd. By inspecting Eq. ( |5.12 ) it follows immediately that a constant 
n does not lead to any deflection. This is in contrast to IHZ! where for certain Lorentz-violating 
frameworks with constant Lorentz-violating coefficient it was shown that there is a change in the 
deflection angle caused by Lorentz violation, indeed. However note that in the latter reference 
a Schwarzschild black hole was considered whose line interval had not been cast into generally 


isotropic form, cf. Eq. (5.3). A discussion of this difference leading to more insight into Bouguer’s 


formula is relegated to App. since it is quite technical and probably not of relevance for all 
readers. 
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B. Phenomenology for the isotropic framework 

With the technique further developed, we are ready to carry out phenomenological calculations. 
The goal is to obtain predictions for the change of the light deflection angle caused by particular 
Lorentz-violating frameworks. These predictions will be compared to experiment to obtain sen¬ 
sitivities on controlling coefficients in the minimal SME photon sector. As the most important 
example light deflection at the Sun will be discussed first. However light can be deflected at any 
other massive bodies such as planets. 

First of all we intend to recapitulate the standard result. For vanishing Lorentz violation the 


deflection angle of Eq. (5.12) can be computed analytically. Thereby the integral 2.266 of |89] is 
helpful: 


dx 


J x\/a + /3x + 

For the Lorentz-invariant case we have 


1 . / 2a + /lx 

: - arcsm - 

—« yxyZ/l^ —day 


a < 0, /3^ — day > 0 , 


d f d „ 


/1 = 2, y = l, 


(5.13) 


(5.1d) 


with the Schwarzschild radius Rg = 2GM/(P‘ of the massive body. Here G is the gravitational 
constant, M the mass of the body, and c the speed of light. The conditions for a, j3, and y stated 


in Eq. (5.13) are fulfilled and the full analytical result for the deflection angle is given as follows: 

l + 2e ^ 


^ = 




TT -|- 2 arcsin 


2^ 

1 + 2 ^ 


- TT = d^ + ©(e") 


^ 2d ’ 


(5.15) 


where the latter is the first-order expansion in the dimensionless parameter ^ <C 1. Now considering 
a light ray directly passing the surface of the Sun (scraping incidence), d is given by the radius Tq 
of the Sun. Using the values of Tab. [I] and multiplying the previous equation with 180 • bO^/vr leads 
to the well-known result ~ 1.75'^ which lies within few standard deviations from the mean value 
observed during the total eclipse in 1919 [9T1|92]. 


Now the refractive index is modified due to Lorentz violation according to Eq. (5.7a). Therefore 


the isotropic Lorentz-violating coefficient Ktr is promoted to a spacetime-position dependent func¬ 
tion (cf. Fig. [^. It is assumed to only depend on the radial coordinate r to keep the framework 
isotropic: 


Ktr Ktrir) = Ktr [1 “ fir)] , 


(5.16) 


Quantity 

Unit 

Value 

G 

m^/(kg • s^) 

6.67384 • 10-11 

Mq 

kg 

1.98910 • lO^o 


kg 

1.89813 • lO^i' 


kg 

5.68319 • 1026 

^0 

m 

6.95508 • 10® 


m 

6.99110- lOi” 


m 

5.82320 • lOi' 


TABLE I: Gravitational constant, masses, and radii of the Sun (©), Jupiter (T), and Saturn (b). For Jupiter 
and Saturn the average of the pole and equatorial radii is used. 
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FIG. 2: Spacetime-position dependent refractive index (and controlling coefficient) as a function of the 
dimensionless parameter r jd where d corresponds to the Sun radius Tq in this example. (The picture of the 
Sun was taken by SOHO - EIT Consortium, ESA, NASA, cf. http://science.nasa.gov/science-news/ 

science-at-nasa/2003/22apr_currentsheet ) 


with a function / having special properties. The latter shall be constructed such that 1 — / > 0 
for r/d > 1. This means that the sign of Ktr(^) is fixed by the sign of the constant prefactor Ktr- 
Furthermore limri_!.oo Ktr(r) = Ktrj whereby in the asymptotically flat region the position-dependent 
controlling coefficient is identified with the corresponding SME photon coefficient Ktr in Minkowski 
spacetime. The refractive index then reads as 


n(r) = 


-t- Ktr(r) 


1 + ^ 2 ) 

r J 


1 - Ktr(r) 

From Eq. (5.7b) the coordinate velocity of light in this framework is given by 
|dr| 1 


c = 


dt 


n{r) ’ 


(5.17) 


(5.18) 


i.e., for Ktr > 0 it is reduced in comparison to the Lorentz-invariant case. The position dependence 
shall reflect the properties of the gravitational background. The curvature radius Rs is the physical 
scale of the background, i.e., it is reasonable to associate it with Ktr(?') as well. Note that we are 
only interested in the behavior of the function outside of the massive body, which means r > d. 
Generic functions with these properties are 


fir) 


1 u 


r — d\ 


2 


-1 


(5.19a) 


2arctan {a [l — (r — d)^/i?|] } vr 
2 arctan(a) -|- vr 

where a < 1 is a free, dimensionless parameter. Therefore for these particular sample functions 
it holds that f{d) = 1 and limj-^^oo fir) = 0. Whatever the underlying theory for a possible 
violation of Lorentz invariance looks like, it is reasonable to assume that the amount of Lorentz 
violation is influenced by a gravitational background field. Referring to a small-scale structure 
of spacetime where simple models were shown to produce Lorentz-violating particle dispersion 
relations nails] the argument could be along the following lines. A gravitational field has an 
energy density associated to it, cf. [DOj for the case of spheres and black holes. Since a spacetime 
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foam is caused by energy fluctuations, an additional contribution of energy density associated to 
a gravitational field may have some influence on it. This would render the effective controlling 
coefficients for Lorentz violation spacetime-position dependent. Hence for the isotropic framework 
considered the refractive index directly at the surface of the Sun may have a dip for Ktr > 0 or a 
peak for Ktr < 0 in its position dependence (cf. Fig. [^. As long as the underlying description is 
not available, it is challenging to deliver a more rigorous argumentation. Hence a Ktr(?') including 


Eq. (5.19) with the parameter a controlling the width of the dip/peak must be interpreted as a 
phenomenological description of such effects. 

Now the modified deflection angle can be calculated in two different ways. The first is to 


compute the integral according to Bouguer’s formula of Eq. (5.12). The second is to solve the 


eikonal equation directly. In App. 0 the eikonal equation is brought into a form that is suitable 


for solving it. For a refractive index that has a radial dependence only, Eq. (C.5b) results in 


0 = (r^ -|- -|- n(r^ -|- 2r^ — rr) 

or 


(5.20) 


Bouguer’s formula is a first integral of the eikonal equation that follows from angular momentum 
conservation. Therefore using it allows us to avoid the computation of one integral. Nevertheless 
as a cross check it is reasonable to carry out the computation with the two techniques. Both 


the integral of Eq. (5.12) and the eikonal equation are challenging to be solved analytically for a 


refractive index that is modified by Lorentz violation. Therefore we attempt to treat both cases 
numerically with Mathematic a. 

To gain some physical understanding, the eikonal equation is solved numerically for hypothetical 


values of Rs and Ktr first. At the distance of minimal proximity d the sample functions of Eq. (5.19) 


vanish by construction. Therefore n{d) = 1 -|- Rs/d, which is why C = n{d)d and the impact 
parameter is given by 


don — 


C 


' 1 - Ktr 


n{r = oo) y 1 + Ktr 


{d + Rs) 


(5.21) 


Eor realistic situations, i.e., light bending at stars the Schwarzschild radius is much smaller than the 
distance of minimal proximity. Note that for scraping incidence, d corresponds to the radius of the 
star. Since bounds on the isotropic coefficient Ktr in flat, asymptotic spacetime are strict, it holds 
that d ~ doo to a good approximation. For the hypothetical values that we choose for illustration 
purposes this is not necessarily warranted. Taking d/Rs = 5 and Ktr = 1/20 we obtain the results 
depicted in Fig. The curves show the solutions of the eikonal equation where the refractive 


index has been modeled according to Eq. (5.17) using the sample function /(r) of Eq. (5.19a) for 


different choices of the parameter a. Recall that the latter characterizes the width of the dip/peak 
in the refractive index directly at the snrface of the massive body, which is caused by Lorentz 
violation. Since for comparison all curves should meet at a single point, the impact parameters doo 
have to be adapted properly, which is why they differ from each other. 

The observation is that for increasing a and Ktv > 0 the deflection angle is reduced. As long 
as the light ray is far away from the massive body it experiences a refractive index that increases 
when the distance to the body decreases. This is the standard behavior of the refractive index 
whose origin lies in nonvanishing Riemann curvature components. Upon approaching the massive 
body the light ray suddenly experiences the dip where the refractive index becomes smaller for 
decreasing distance. The ray then behaves contrary to the standard case and tends to be bent 
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(b) 


FIG. 3: Solutions of the eikonal equation in the cc-y-plane (in dimensions of d) for kh = 1/20 (a) and for 
= -1/20 [(^ The massive body resides in the origin and the hypothetical value d/Rs = 5 has been 
chosen. The black (plain) curve shows standard light deflection where for the remaining {red (dotted), blue 
(dashed), green (dashed-dotted), orange (dashed with large spaces)} curve a = {1,1/10,1/10^,1/10^} has 

been taken successively. 


away from the body, which can be clearly seen in Fig. 3a Note that for Ktr < 0 the dip in the 
refractive index turns into a peak. Hence the behavior is opposite and the ray is bent towards the 


body even stronger, cf. Fig. 3b 


From a technical point of view to solve the eikonal equation, proper initial conditions have to be 
considered. Since the ray is assumed to arrive from an asymptotically flat region, the initial angle is 
(po = TT. In practice an angle lying close to vr must be chosen where the direction of the ray initially is 
assumed to point along the positive horizontal axis. We express the solution of the eikonal equation 
as r{p) = d^{4>) with the dimensionless function The initial conditions are then fixed to be 

^(0o) = ^ and P,\4>o) = — cot((/)o)A where po = tt — arcsin(doo/A). Here A is a length scale with 
the property A ^ d, which is tuned to increase the precision of the numerical result. Theoretically 
A should approach infinity, which is not a possible value to choose in practice, though. Setting the 
final angle in the numerical integration to (/)i < 0 leads to numerical instabilities, which is why pi 
is taken to be slightly larger than zero. This is supposed to be sufficient for small bending angles 
that appear in realistic scenarios. It is reasonable to set both the working precision to a large 
number and the maximum number of steps to infinity. 

There are at least two space-based missions available that could test gravity based on light 
deflection. Two of the most promising ones are GAIA and LATOR. In what follows we will discuss 
the perspective of these missions in obtaining constraints on Lorentz violation in the (isotropic) 
photon sector by performing measurements of light deflection at massive bodies. Thereby the 
theoretical tools developed so far will be of great use. 
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1. Sensitivity of GAIA 


GAIA^ IMj is a space probe that was launched in December 2013 by ESA. The mission goal 
is to perform measurements of positions and radial velocities of about 1% of the galactic stellar 
population, which shall generate a three-dimensional map of our galaxy. This is supposed to give 
information on the galactic history, dark matter as well as extra-solar planetary systems. GAIA 
can measure angles with a sensitivity of around 10 pares, which is why it can test deflection of 
light at massive bodies to a high precision. However the mission parameters do not allow light to 
be measured grazing the surface of the Sun. Such measurements will be possible for Jupiter and 
Saturn only (see Table III in [9l]l. 

Now we intend to perform phenomenology of light bending in an isotropic Lorentz-violating 
framework based on the possibilities of GAIA. Thereby sample functions are taken according to 


Eq. (5.19) with different values for the parameter a = 1/10* and the range i = 0... 15. Ghoosing 


a particular controlling coefficient Ktr, the deflection angle of light in the vicinity of Jupiter is 


computed with two methods. The first uses the formula of Bouguer, Eq. (5.12). The second solves 


the eikonal equation (5.20) numerically in analogy to what was described above. The bending 


angle is then computed via the scalar product of the initial and final normalized tangent vectors. 
This gives an excellent cross check for the results, since the two methods are independent from 
each other. 

The bending angle obtained is then compared to the standard result. This procedure is repeated 
for a decreasing isotropic coefficient Ktr until the difference between the modified and the standard 
result approximately matches the precision that GAIA can measure angles with. This sets the 
sensitivity of the experiment with respect to Ktr in a curved background. However it is challenging 
to compute the integral or to solve the eikonal equation with a high precision. We use the difference 
of the results obtained from the two methods as a measure for how meaningful they are. Eor a 
conservative estimate of the sensitivity one should keep results only if this theoretical uncertainty 


-logio(a) 

-logio(Ktr) 

fir) 

pXr ~ [10 pares] 

sir) 

p\ — p% [10 pares] 

0 


14 

1.61 

1.40 

1.. 

4 

13 

9.02; 5.07; 2.85; 1.60 

8.80; 4.97; 2.80; 1.57 

5.. 

8 

12 

9.02; 5.07; 2.85; 1.60 

8.84; 4.97; 2.80; 1.57 

9... 

12 

11 

9.02; 5.07; 2.84; 1.59 

8.84; 4.97; 2.79; 1.56 

13.. 

15 

10 

8.76; 4.67; 2.32 

8.58; 4.57; 2.26 


TABLE II: Numerical results for modified light deflection angles at Jupiter (scraping incidence) in the 
isotropic framework. The first column states (ranges of) the parameter a used in Eq. ( |5.19[ ). The second 
column gives the value of the isotropic coefficient. In the third and fourth columns differences between the 
standard deflection angle « 16.8 marcs and the modified angle are shown in suitable units. For the 
third column the modeling function f(r) of Eq. (5.19a) is used and for the fourth column we employ g{r) 
of Eq. (5.19b). Each number is understood to be associated to the proper parameter a in the first column 
where both lists are in order. The results for a negative Ktr are equal to the corresponding numbers for a 
positive Ktr where the global sign of — p% is reversed. The deviation in the absolute values shows up in 

the third digit for almost all differences. 


^ The acronym originally meant “Global Astrometric Interferometer for Astrophysics.” Although the foreseen mea¬ 
surement technique was changed upon construction of the apparatus, the acronym was kept. 
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is much smaller than the difference between the modified and the standard bending angle. 

First of all for Ktr > 0 the difference between the standard bending angle and the modified 
bending angle is positive, which shows that the bending angle is reduced by a positive Lorentz- 
violating coefficient Ktv (see the third column of Tab. m- For Ktr < 0 the behavior is vice versa and 
the absolute numbers mainly deviate in the third digit, which is why they are omitted in the table. 
We stated all differences (p* — ip% that are larger than and lie in the vicinity of the experimental 
precision of GAIA, i.e., 10 pares. Such modifications can be expected to be detectable by this 
mission. From the results it becomes clear that the sensitivity of the isotropic coefficient reduces 
when the width of the dip, which is controlled by the parameter a, decreases. If the width lies in 
the order of magnitude of Jupiter’s radius the sensitivity for |Ktr| is 10“^^. In case the width lies 
15 orders of magnitude below that the sensitivity of |Ktr| is still 10“^^. Hence the sensitivity does 
not decrease as quickly as the parameter a. The numbers are meaningful, since the difference of 
the results obtained with Bouguer’s formula and by solving the eikonal equation directly is around 
4 X 10“^^ pares at the maximum. The latter is interpreted as the theoretical uncertainty and it is 
much smaller than |(/?* — 

Obtaining the modified deflection angles for Saturn works completely analogously. The sensi¬ 
tivity on Ktr lies in the same order of magnitude. The only difference is that even smaller a could 


be probed based on a modeling according to Eq. (5.19). The reason is that 




2.78- 


d% 


Rsfy Rs,% 

whereby the additional dimensionless factor increases the contribution of a. 


(5.22) 


2. Sensitivity of LATOR 


LATOR (Laser Astrometric Test of Relativity) [95) |96] is a mission that is being planned by 
a collaboration of NASA and ESA. It is a Michelson-Morley-type experiment that shall perform 
curvature measurements in our solar system to determine the Eddington post-Newtonian parameter 
7 with a precision of 1 part in 10®. It is considered to be a test mission for General Relativity and 
it is supposed to detect the frame-dragging effect and to determine the solar quadrupole moment. 
The primary objective will be to measure the gravitational deflection of light by the Sun to an 
accuracy of 0.02 pares. Such an astounding precision shall be made possible by an improved laser 
ranging and a long-baseline optical interferometry system. 

We carry out phenomenology as we did before by choosing different parameters a for the sam¬ 
ple functions of Eq. (5.19). The calculations are completely analogous to before where the only 


difference is that they are carried out for the Sun using the appropriate parameters of Tab. [H The 
essential numerical results are stated in Tab. imi The bending angle behaves similarly to before, 
i.e., it is reduced for Ktr > 0 and it increases for Ktr < 0 . The differences are listed that lie 

in the vicinity of the experimental precision expected for LATOR, i.e., 0.02 pares. If the width of 
the dip/peak in the refractive index of the Lorentz-violating vacuum lies in the order of magnitude 
of Sun’s radius the sensitivity for the isotropic coefficient |Ktr| is 10“^®. The lowest sensitivity 
in case of a very narrow dip/peak is 10“^^. Comparing the results determined from Bouguer’s 
formula to the results from the numerical solution of the eikonal equation reveals differences of ca. 
2 X 10 “^ pares. Therefore the theoretical uncertainty is still much smaller than \p* — Pq\. Note 
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that for the model function g{r) the modihcation of the deflection angle for |Ktr| = 10“^® is smaller 
than 1.50 x 10“^ jrarcs. Therefore assuming this model function, the sensitivity of LATOR will not 
be sufficient to detect a l^trl lying in the order of magnitude of 10“^®. 


3. Discussion 

According to the current (2015) version of the data tables [5] the strictest lower bounds on Ktr he 
in the order of magnitude of —10“^® where the best upper bounds are around 10“^^. The isotropic 
coefficient of modified Maxwell theory is challenging to be constrained in laboratory experiments, 
which is why these bounds are related to ultra-high energy cosmic rays. With the precision of LA- 
TOR there would be a space-based experiment performed under controlled conditions that could 
have a sensitivity comparable to the best current constraints on a negative Ktr- This is astonish¬ 
ing taking into account that the precision of a man-made experiment may match the sensitivity 
reached by the most energetic particles propagating through interstellar space for distances of many 
lightyears. It illustrates the versatility of the technique presented to constrain Lorentz violation in 
the photon sector by precise measurements of light bending at massive bodies. Note that the sen¬ 
sitivity does not largely depend on the model function used. This independence could be checked 
for further model functions, which can be regarded as an interesting future project. 


C. Anisotropic (nonbirefringent) case 


The anisotropic case of modified Maxwell theory exhibiting a single modified dispersion relation 
was discussed in Sec. nirni This particular case is characterized by a preferred spacelike direction 
(chosen to point along the positive z-axis) and one controlling coefficient. The refractive index was 


found in Eq. (4.13b) and it was expressed in terms of the angle 'd enclosed between the propagation 


direction and the preferred axis. The possible trajectory of a light ray is parameterized by r{(f>) = 
r{4>)e^ such as for the isotropic case. The angle "d in the refractive index is then given by the scalar 
product of the tangent vector t and the preferred direction ^ where it is sufficient to work in two 


-logic (a) 

- logic («^tr) 

fir) 

- Vq [10"^ pares] 

gir) 

‘P© - 7^0 [10"^ pares] 

0 

16 

1.57 

— 

(0)1...4 

15 

8.84; 4.97; 2.80; 1.57 

13.6; 8.58; 4.84; 2.72; 1.53 

5...8 

14 

8.84; 4.97; 2.79; 1.56 

8.61; 4.84; 2.72; 1.52 

9... 11 

13 

8.57; 4.56; 2.26 

8.35; 4.43; 2.18 

12... 13 

12 

9.99; 3.91 

9.49; 3.64 

14...15 

11 

13.9; 4.64 

12.7; 4.21 


TABLE III: Numerical results for modified light deflection at the Sun in the isotropic framework (see 
Tab. HD for the meaning of each column). The standard deflection angle for scraping incidence at the Sun is 
(fQ « 1.75arcs (see Eq. ( 5.15| ) and the subsequent paragraph). In the second line a = 0 is associated to the 

flrst value in the fourth column. 
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FIG. 4: Solution of the eikonal equation in the x-y-plane (in dimensions of d) with d/Rs = 5. The blue 
(plain) curve shows the solution for the Lorentz-invariant case, whereas the red (dashed) curve depicts the 
solution for the anisotropic case with = 0.65. The massive body resides in the coordinate center. 


spatial dimensions: 


r{4>) cos 4> + r((j)) sin (f> 

|t| i/r((/))2 + r((/))2 


(5.23) 


Note that for the anisotropic case angular momentum is not conserved and Bouguer’s formula loses 
its meaning. Hence there does not seem to be an alternative to solving the eikonal equation directly, 
which is carried out numerically for hypothetical values of Rs and the controlling coefficient 
The results are shown in Fig. In contrast to the isotropic case, cf. Fig. where the trajectory is 
not modified for a spacetime position independent Ktr this is not the case here. For the anisotropic 
sector the shape of the trajectory gets distorted where the final impact parameter decreases for 
Kgl > 0. Physically this means that the ray loses angular momentum. An interesting future 
research project would be to perform a similar kind of phenomenological analysis as we did for the 
isotropic case. 


VI. MODIFIED ENERGY-MOMENTUM CONSERVATION 

The phenomenology in the previous section was carried out in an explicitly Lorentz-violating 
framework, which is known to cause tensions in a gravitational background |39]. The purpose of 
the current section is to investigate where exactly these problems occur in our classical description 
and how they can be interpreted from the point of view of an inhomogeneous medium. There¬ 
fore the energy-momentum tensor and its conservation law will be derived for the isotropic case. 
The (Belinfante-Rosenfeld) energy-momentum tensor follows from varying the corresponding La- 
grangian with respect to the metric. The Finsler structure T(u)|@ of Eq. (3.14b) is the equivalent 


to a Lagrangian, since it appears as the integrand of the path length functional that is stationary 
for the trajectory travelled by the light ray. Instating an auxiliary metric tensor leads to the 
following result: 


F = nlul = 


1 + 

1 - Ktr -IppaC^C 




(6.24) 
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Note that in Minkowski spacetime it holds that = 1 and where the 

minus sign in the latter term is due to the signature of the metric chosen. Variation has to be 
carried out for all independent degrees of freedom. A useful formula is 

= 2A^5A^ + SiP^.uA^^A'^, (6.25) 

which states the variation of a scalar product of helds. Employing this rule, the variation of F can 
then be computed as follows: 



Now everything is available to obtain the energy-momentum tensor from 5F by considering all 
terms comprising a variation of the metric. An additional prefactor containing the metric has to 
be taken into account in the dehnition. However we are interested in the covariant conservation law 
of for Minkowski spacetime, i.e., for a spacetime-position dependent refractive index without 
an additional gravitational field. In this case whereby 


2 5{^\F) 

\/W\ 




n{l - KtrY 


- 


= nVuM -{n^ -y - 


(6.27) 


Here '0 = det(V'^;y) and [u^] = (0, u)^, i.e., involves the spatial velocity and its zeroth component 
vanishes. Upon inspection of the latter result we see that the 00-component of is made up by 
the preferred timelike spacetime direction and it vanishes for n = 1, i.e., in a Lorentz-invariant 
vacuum. The spatial part solely comprises products of three-velocity components and the mixed 
components vanish. Now the partial derivative of the energy-momentum tensor in Minkowski 
spacetime leads to: 


= ^ I (sn^ + eC - . (6.28) 
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An interesting observation is that the timelike contribution can be expressed in terms of the metric 


appearing in Eq. (5.7b): 


(9 


S'lM _ 


1 


— 3n H—^ , 




giiuir) = diag ( , -n(r), -n(r), -n(r) ) . 

Jt,u 


(6.29a) 


(6.29b) 


Note that gfj,^ is not associated to a gravity field but only to a nonconstant refractive index. The 


result obtained in Eq. (6.28) describes the conservation of energy and momentum of a light ray. Its 


properties are in order. First, it vanishes for a constant refractive index, i.e., energy and momentum 
of the ray are conserved in a homogeneous medium, in the Lorentz-invariant vacuum, and a Lorentz- 
violating vacuum with a constant controlling coefficient. Second, in an inhomogeneous medium or in 
a Lorentz-violating vacuum with spacetime-dependent controlling coefficient the energy-momentum 
tensor is not conserved, since the partial derivative of the refractive index does not vanish in this 
case. As long as the refractive index is not time-dependent, don = 0, which is why = 0. 

Since the case under consideration is isotropic, the controlling coefficient and the refractive index, 
respectively, can only depend on the radial coordinate: Ktr = Ktr(r), n = n(r). Hence d^n has 
a nonvanishing component along the radial basis vector only, i.e., drU ^ 0 and dgn = d^n = 0 . 
Decomposing the spatial velocity into a radial part and transverse components 


U = Sr + U^eg + 


(6.30) 


the spatial part of the conservation law reads as 

dpT^^ = - ^={'Vn ■ u)u* = — \/u2(Vn • u)m* = —V^{drn)ifu‘ , u = -^ . (6.31) 

VU^ |u| 

Several observations can be made upon inspecting the result. For a constant refractive index the 
right-hand side of the latter equation is zero, which means that the spatial part of the energy- 
momentum conservation law is valid as well in this case. For drU 7 ^ 0 it even holds when the radial 
velocity component vanishes: u'" = 0. This is a special situation that can occur for a light ray in an 
inhomogeneous, isotropic medium whose refractive index has a particular r-dependence and when 
the ray is emitted tangentially to a circle with its center lying in the coordinate origin (cf. |97| 
for a beautiful paper on geometric-ray optics and its implications for certain optical systems). 
The trajectory of the ray is then a circle where the refractive index is constant. The magnitude 
of the three-momentum vector does not change, but only its direction. So momentum is not 
exchanged between the light ray and the medium, because any momentum transfer would change 
the magnitude of the momentum vector. For any other case with nonzero drU momentum has to 
be exchanged, which is why of the ray cannot be conserved. The net term obtained above 
points in the direction u of the ray at the point considered. However the total energy-momentum 
tensor with of the medium included is expected to be conserved, because any momentum 
change of the light ray will cause a momentum change of the medium itself. 

In General Relativity local diffeomorphism invariance is tightly linked to energy-momentum 
conservation. In [39] it was shown that explicit Lorentz violation in gravity leads to a loss of diffeo¬ 
morphism invariance, which then causes the energy-momentum tensor to be no longer covariantly 
conserved. Note that in the latter reference the energy-momentum tensor Te'^ is considered that 
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follows by varying the Lagrangian with respect to the vierbein instead of with respect to the met¬ 
ric tensor. It is different from the Belinfante-Rosenfeld energy-momentum tensor considered here 
even in case there is no Lorentz violation |98) . The covariant derivative of in [39j involves the 
covariant derivative of the Lorentz-violating controlling coefficients, i.e., a term of the structure 
J^Dykx- Here kx is a generic controlling coefficient with a particular Lorentz index structure x 
contracted with an appropriate operator . For general curved manifolds there is no spacetime- 
position dependent function satisfying Dykx = 0, but only for parallelizable manifolds such as the 
circle or the two-torus = 5^ x . In four dimensions such manifolds are rare, though, and 
they do not seem to be of particular interest in the context of General Relativity. 

Note that the conservation law without gravitational fields is given by i9^(0c)^^ = J^d'^kx with 
the canonical energy-momentum tensor (0c)^^ |39j . Therefore if the controlling coefficient kx is 
dependent on spacetime position the conservation law is modified even in flat spacetime. From 
a physical perspective this is not surprising, since such a controlling coefficient implies that the 
vacuum behaves like an effective, inhomogeneous medium. In general the magnitude of the three- 
momentum of a light ray is not conserved as was argued above. Therefore momentum has to 
be exchanged between the ray and the medium. When considering explicit Lorentz violation the 
effective medium is considered to be nondynamical, which is why it can neither absorb nor deliver 
momentum to the light ray. 

Interestingly the situation is different when spontaneous violations of diffeomorphism invariance 
and local Lorentz symmetry are considered. In these cases the ground state violates these symme¬ 
tries dynamically by an emergent vacuum expectation value of a vector or tensor field in a potential 
la ESI SQl [M56]. Such models have in common that they involve massless (Nambu-Goldstone) 
modes where the latter appear when any global, continuous symmetry is broken spontaneously. 
When the symmetry is local there can be an additional Higgs-type mechanism absorbing the mass¬ 
less modes to produce massive gauge fields. Since for spontaneous Lorentz violation the dynamics 
of the Lorentz-violating background field is taken into account, the energy-momentum conservation 
law is restored in these theories. In the corresponding equation there is no contribution J^d^kx- 

From the perspective of an inhomogeneous medium translational and rotational symmetry are 
violated spontaneously by the atomic lattice. The Nambu-Goldstone modes (gapless excitations) 
linked to the spontaneous violation of these symmetries are the two transverse and the longitudinal 
types of phononsj^ Since the medium is now dynamical, it can absorb momentum from the ray 
upon producing phonons. Hence the conservation law for the light ray remains valid in this case. 

VII. PROPERTIES OF THE ISOTROPIC FINSLER SPACE 

In the previous section the modihed conservation law for the Belinfante-Rosenfeld energy- 
momentum tensor for a light ray in an isotropic, inhomogeneous medium was considered and 
discussed. It was found that the energy-momentum tensor is not conserved, since the nontrivial 
medium is nondynamical corresponding to a background violating Lorentz symmetry explicitly. 
This issue persists even in Minkowski spacetime, since in inhomogeneous media light rays behave 
similarly to when they propagate in gravitational backgrounds. The most prominent example for 
a common effect is light bending. 

® The number of spontaneously broken translational and rotational generators is six. However as they are not 
independent from each other, the total number of Nambu-Goldstone bosons is reduced by these constraints to be 
three. Relations between broken symmetries and Nambu-Goldstone bosons in certain nonrelativistic systems such 
as crystals, ferromagnets, and superfluids are nicely described in [99]. 
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The conservation law of the energy-momentum tensor for a medium with spherically symmetric 


refractive index, Eq. (6.31), involves both the first derivative of the refractive index and the prop¬ 


agation direction of the ray at a given point. Recall that in gravitational theories with explicit 
Lorentz violation nonconservation of the energy-momentum tensor clashes with the Bianchi iden¬ 
tities of Riemannian geometry |3S]- Although we work with an effective Lorentz-violating theory 
for light rays based on a nontrivial refractive index, this issue can be encountered here as well. 

The purpose of the current section is to figure out whether explicit (isotropic) Lorentz violation 
can be considered in a weak gravity field in the framework of Finsler geometry such that no 


inconsistencies arise. As a basis we use the spacetime metric of Eq. (5.7b), which was shown to 


be closely linked to the isotropic case. The properties of the latter metric shall be studied from a 
Finslerian point of view where we use the conventions of [59] for all geometrical quantities. The 


latter are treated based on the indefinite signature of the metric in Eq. (5.7b). As a starting point 


an appropriate Einsler structure has to be constructed whose derived metric should correspond to 


Eq. (5.7b). This works for the following choice: 


F{r,y) = \/y^, =-{y^f - n{y''f - n sin^ 0 

n L 


(7.1) 


where the refractive index solely has a radial dependence. In what follows we write n(r) = n for 
brevity, i.e., the argument of the refractive index will be omitted. The vector y G EM is expressed 
in spherical polar coordinates as y = y^et+y^er+y^eg+y’^^e^j, with suitable basis vectors. The spatial 
part of the Finsler structure is written in terms of spherical polar coordinates. The corresponding 
Finsler metric is then computed according to the usual definition and it corresponds to the result 


of Eq. (5.7b) (with the spatial part transformed to spherical coordinates): 
1 ^ 2^2 


9^il' 


2 dyt^dy 

The inverse metric simply reads as 
= diagfn,--,-^,- 


= diag ( —, —n, —nr^, —nr'^ sin2 9 


(7.2a) 


flV 




1 




n 


nr^ ’ nr^ sin2 9 




Since the metric does not depend on y, the Cartan connection [59] vanishes: 




Fdg, 


flU 


^3^2 


2 dy^ 4 dy^dy’^dy^ 


(7.2b) 


(7.3) 


Therefore according to Deicke’s theorem m the Finsler space considered is Riemannian. Now the 


base has been set up to study the geometry of the space defined by Eq. (7.1). The first step is the 


obtain the coefficients of the affine connection (Christoffel symbols of second kind) that are defined 
in analogy to the Christoffel symbols in Riemannian geometry: 


— 

/ vp 


1 


pa. 


dgau dg, 


up ^ dgpa 


dxP dx°‘ dx 


(7.4) 


Note that summation over equal indices is understood based on Einstein’s convention. The nonzero 
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contributions read as follows: 


j n' ^ n' j. n' j. r(2n + rn') 

^ 


r(2n + rn') . n 

= -— *■“ ' 


1 Tlf 

7^re = - + ^, 7% =-sm0cos0, 


(7.5a) 


(7.5b) 



1 n' 

~ 9~ ’ 

r 2n 



cot 0. 


(7.5c) 


Since torsion is assumed to vanish, the Christoffel symbols are symmetric in the latter two indices. 
The connection coefficients with at least one index equal to the radial coordinate r involve the 
first derivative of the refractive index. Furthermore they do not involve the angle </> as expected 
for spherically symmetric metrics. As a next step the geodesic spray coefficients are needed: 
= 'y^ueU'^y^- The latter appear in the geodesic equations in Finsler geometry: 
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= -yV -, 

n 


(7.6a) 


G"’ = ^ {iy''fn - r(2n + rn')[(/)^ + (/)^ sin^ , 


2 n' 


n 


- (y'^)^sin0cos0, G^ = y^ 


2y^ cot 0 + ?/'’ ( ^ _g 'I 
r n 


(7.6b) 


(7.6c) 


The geodesic spray coefficients can be used to define the nonlinear connection |59] on TM \ {0}: 

2 dy'^ ^ ^ 


The reasons for introducing these connection coefficients is as follows. On the one hand the basis 
vectors djdx^ and d/dy'^ are unsuitable to be chosen as a local basis of TTM, since the djdx^ 
transform in a complicated way. On the other hand if {dx'^,dy'‘} is chosen as a local basis of the 
cotangent bundle T*TM the transformation properties of dy^ are involved. To have the desired 
transformation properties for the basis of the tangent and the cotangent bundle of TM \ {0} the 
following basis vectors can be introduced using the nonlinear connection: 


\ 5x^ ’ dy'^ j ’ 


6 _ d 
5x'^ dx’^ 




d 

^y^^ ’ 


(7.8a) 


dx'', 



5y^ = dy^ + N^^^dx'^. 


(7.8b) 


For the particular case studied here the nonlinear connection coefficients can be comprised in a 
(3 X 3) matrix that reads as 




/ —y’^n'I'n 


—y^n'/n^ 

0 

0 


—y^n'/n 
y'^n'/n 

y®(2/r + n'/n) 
y'^i^jr + n'/n) 


0 0 \ 

—y^r(2n + rvl^jn —y^rslv? 6{2n + rn')/n 
y'^{2lr + n'jn) —y'^sin(20) 

2?/‘^cot0 y''(2/r + n'/n) + 2y® cot 


(7.9) 


To compute directional derivatives of tensor fields on Finsler manifolds, a further connection has 
to be found to define a covariant derivative. It was shown that the pulled-back bundle ti*TM has 
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a linear connection associated to it, which is called the Chern connection T^ug. Explicitly it can 
be obtained from the Finsler metric using the nonlinear connection 


UQ 


= -9^"" 


^9oti/ 

6x^ 

dQau 


UQ ^ ^9qoi 


6x°‘ ' 6x 
/3 dg, 


- N' 
dxQ e dyf^ 


dg, 


UQ 


dg, 


dx°‘ “ dyf^ \ dx'^ dy^ J 

The Chern connection is unique and formally it has the same index structure as the formal Christof- 
fel symbols. The difference to the latter is that the derivative 5/5x^ is used instead of the ordinary 
partial derivative djdx^. However in the particular case studied here, T^^q = I^uq, since the Finsler 
metric 5 ^ 1 / does not depend on the components of y. Finally the Chern connection is needed to 
define a Finslerian version of the Riemann curvature tensor: 


R ^ = 

ga 


5T>^ua 


UQ 


6x^ 

dT^ua 
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' ag ua aa ug ' 


(7.11) 


dx^ ^ dy^ 

Since the Chern connection coefficients correspond to the formal Christoffel symbols and the latter 
are independent of the curvature components correspond to the Riemannian ones. They involve 
an additional derivative of the Christoffel symbols, which is why they comprise second derivatives 
of the refractive index. Explicitly the independent curvature tensor components are stated as 
follows: 
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(7.12a) 

(7.12b) 

(7.12c) 


The components related by symmetries are omitted. Since the Finsler structure of Eq. (7.1) is 
Riemannian according to Deicke’s theorem, we will first use the Riemannian definitions of the 
Ricci tensor = Rff^au and the curvature scalar (Ricci scalar) [Ric. These are denoted by 

calligraphic letters and they follow from suitable contractions of the Riemann curvature tensor. 
The Ricci tensor components with equal indices deliver nonzero contributions only: 


diicfi — 
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In Riemannian geometry the curvature tensor obeys the first and the second Bianchi identities. 
Especially the second one, 
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is important in the context of General Relativity, because it leads to the statement that the Einstein 
tensor is covariantly constant: 


[flic 

= d^G% + = 0 , G^’'^ - —ff 


fliy 


a 


(7.15a) 


which was checked to be valid for the particular metric gfj,^ of Eq. (7.2a). This identity is the reason 
why explicit Lorentz violation is incompatible with Riemannian geometry. Due to the Einstein 
equations it forces the energy-momentum tensor to be covariantly conserved as well, which does not 
necessarily hold when there is a spacetime-dependent background. At this point it is reasonable 
to wonder how Finsler geometry can help us to solve that problem. For the isotropic metric 
considered the identity D^G^u = 0 is inherited from the Riemannian to the Finslerian framework, 
since the Finsler metric of Eq. (7.2a) does not comprise any dependence on Assuming that 

in a general 


Finsler geometry provides the necessary tools to circumvent the no-go theorem of 
explicitly Lorentz-violating setting, then it should also work for the special isotropic case studied 
here. 

One possible approach (there may be others) might be to consider a suitable equivalent of the 
Einstein equations in Finsler geometry. Such an equivalent can be based on an alternative definition 
of the Einstein tensor G^’^ constructed from curvature-related tensors in the Finsler framework. 
These objects will be introduced in what follows. The first is obtained from the curvature tensor 
by contracting the latter with two vectors /F according to 


R, =y_R, y_ 

Q ~ p ^ p 


(7.16) 


Note that this construction does not correspond to the Ricci tensor of Riemannian geometry. In 
particular it is sometimes referred to as the predecessor of flag curvature, which is a generalization 
of sectional curvature in Finsler geometry. For the special case here R^g are the components of a 
(4 X 4) matrix. The trace of this matrix is taken to obtain the generalization of the Ricci scalar in 
Finsler geometry: Ric = Since the explicit expressions for R^g and Ric are complicated and 
not illuminating, they will not be stated explicitly. 

The flag curvature in Finsler geometry is computed similarly to the sectional curvature in 
Riemannian geometry. The latter is defined in a tangent space at a point x of the manifold 
where two arbitrary, linearly independent directions are needed for its computation. The resulting 
quantity only depends on the plane considered, but not on the particular choice of the directions. 
The flag curvature in Finsler geometry carries the same spirit where one direction is chosen to 
correspond to y and the other one, say L, is supposed to be orthogonal to y. These vectors 


are then suitably contracted with the curvature tensor of Eq. (7.11). Note that y and the vector 


orthogonal to it can be considered to span a flag where y is assumed to point along the flag pole. 
This explains the name for the curvature. For an n-dimensional Finsler manifold R is the sum of 
n — 1 flag curvatures. It only depends on r and y, but not on the direction L chosen orthogonal 
to y. 


Although Rg^g of Eq. (7.16) is not understood to be the generalization of the Ricci tensor in 
Finsler geometry, it is still possible to define the latter. The definition (cf. Eq. (7.6.4) in [59]) 
involves both the Finsler structure F and the Finslerian version of the Ricci scalar Ric: 

Id'^iF'^Ric) 

— — - 


2 dyi^dy‘' 


(7.17) 
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For Finsler metrics that are Riemannian, i.e., for the isotropic metric considered in Eq. (7.2a) it 


also holds that Ric^u = 


Hence in our case the Finslerian definition of the Ricci tensor 
corresponds to the Riemannian expression, computing an appropriate trace of the curvature ten¬ 


sor. The expression of Eq. (7.17) can be used to obtain the Ricci scalar in Einsler geometry by 


contracting the Ricci tensor with two vectors y^/F (cf. (7.6.5) in 


yfJ. yU 

Ric = Ricav^ — 

p p 


(7.18) 


It can be shown in general that the latter corresponds to R^g that is obtained from tracing 


Eq. (7.16). This object is distinguished from the Ricci scalar [Ric in a Riemannian setting, which 
follows from tracing the Ricci curvature tensor [Ricf^y, cf. Eq. (7.13c). Note that the quantity of 


Eq. (7.18) is the direct Einslerian equivalent of the Ricci scalar. Since the Einsler metric considered 


is Riemannian, [Ric only involves dependences on r, whereas Ric depends on as well. In general 
and especially here [Ric ^ Ric. 

At this stage there are several possibilities of defining the Einstein tensor in a Einsler 
framework using different combinations of [Ric,iv, [Ric, Ru^, Ric 
been tried: 


uytL, . 11 .^ 1 /, and Ric. The following have 




(7.19a) 


(7.19b) 


(7.19c) 


A reasonable test of whether one of these choices is suitable, requires computing their covariant 
derivatives, i.e., I1^(G^[/)^*^ for i = 1.. .3. The wishful result would be a nonvanishing covariant 
derivative bearing resemblance to the modified covariant conservation law of the energy-momentum 


tensor in Eq. (6.31). This makes sense when we assume that the modified Einstein tensor (in a 


Einslerian framework) is linked to the energy-momentum tensor in an explicitly Lorentz-violating 
theory. The corresponding covariant derivative to be used involves both the nonminimal connection 
and the Chern connection T^i^g being equal to the Christoffel symbols 7 ^ 1 /^ in this case: 




afi 




(7.20) 


M Qyli 

Starting from the Einsler metric of Eq. (|7.2a) there have been up to three derivatives with respect 


to the coordinates involved, which is why in Eq. (7.20) the third derivative of the refractive index 


appears in general. The more of the higher derivatives of a Taylor expansion of n(r) are taken 
into account, the smaller are the structures in changes of n(r) to be resolved. Therefore relying 
on the geometric-optics approximation it is reasonable to consider only the first-order change of 
n(r) incorporated in its first derivative and to neglect the higher-order derivatives, which describe 
small-scale changes of n(r). Within this approximation it makes sense to set n(r) = 1, since 
modifications lead to higher-order contributions. 

Eurthermore the Einsler structure that the isotropic case was identified with is three-dimensional, 
cf. Sec. IIIIBl and it involves spatial velocity components only. Hence y^ can be considered as aux¬ 
iliary and will be set to zero at the end. With this physical input the covariant derivative of each 
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Einstein tensor proposed in Eq. (7.19) can be computed. The final result for the third possibility 
looks rather promising: 


/ 
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where terms of G{n",n"',n '^have been neglected. Using the inverse metric of Eq. (7.2b) 


the second index can be raised. Besides we identify the spatial components of y with the spatial 
components of the physical velocity, i.e., y'’ = w'’, , and y'^ = where the spatial flat 

metric in spherical polar coordinates is given by {rij) = diag(l, r^, sin^ 9). This leads to the final 
result 
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(7.22b) 


with the normalized three-velocity vector u = u/|u|. Comparing the obtained result to Eq. (6.31) 


reveals that the structure of both expressions is very similar. The difference is a global prefactor of 
the form The dimensionful factor of is not surprising. Both the Riemann curvature tensor 

and the (modified) Einstein tensor involve two derivatives, which is why their mass dimensions is 
—2. However the energy-momentum tensor is based on the “Lagrangian” of a classical light ray. 


Eq. (6.24), which is a dimensionless quantity. The discrepancy in mass dimensions is compensated 


by the only dimensionful length scale available, which is r. It seems that an alternative definition 
of the Einstein tensor in the framework of Finsler geometry can compensate for the modified 
energy-momentum conservation law when explicit Lorentz violation is considered. This result is 
interesting and deserves further study, e.g., whether it holds for anisotropic theories as well. 


VIII. CONCLUSIONS AND OUTLOOK 

In this work classical-ray analogues to the photon sector of the minimal Standard-Model Ex¬ 
tension were discussed. It was shown that a nonvanishing photon mass allows for deriving classical 
point-particle Lagrangians in analogy to the fermion sector. However the standard method used 
for the fermion sector does not work any more in case the photon mass vanishes. The reason is 
that a light ray does not have as many degrees of freedom as a massive particle. 

Instead, for the photon sector an alternative technique had to be employed which allowed to 
derive a Lagrangian-type function for a classical ray directly from the modified photon dispersion 
relation. This was carried out for several interesting cases of the minimal, CPT-even photon sector, 
which is characterized by dimensionless controlling coefficients. Subsequently it was shown that the 
results obtained are consistent with the eikonal equation approach that describes the geometric- 
optics limit of an electromagnetic wave. Mathematically the Lagrangian-type functions can be 
interpreted as Finsler structures. In contrast to the fermion sector they only involve the spatial 
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velocity components and they are closely linked to an effective refractive index of the Lorentz- 
violating vacuum. 

It has been known long since that there is a connection between the geodesic equations for a 
light ray in a gravitational background and the eikonal equations. This link is warranted for weak 
gravitational fields at least, e.g., in the solar system. It was crucial to set up a phenomenological 
description of light rays subject to Lorentz violation in a weak gravitational field. This description 
made it possible to obtain sensitivities on the isotropic controlling coefficient Ktr that could be 
probed by the space missions GAIA and LATOR employing measurements of light deflection at 
massive bodies. The upshot is that the planned mission LATOR may have a sensitivity on l^trl in 
the order of magnitude of 10“^® where the running mission GAIA can reach 10“^^. The difference 
in sensitivity originates from the different precision of measuring angles for both missions. 

The final part of the paper was dedicated to investigating the properties of the (isotropic) 
curved spacetime, which the phenomenological studies were based on, from a Finslerian point of 
view. It was demonstrated that in the classical limit (neglecting higher spacetime derivatives of the 
refractive index) an Einstein tensor can be defined that is not subject to the usual Bianchi identities 
in Riemannian geometry. Therefore its covariant derivative is nonzero and it has a form that is 
related to the modified conservation law of the energy-momentum tensor based on the classical 
Lagrangian-type function studied in this context. Hence it seems that Finsler geometry provides 
new geometrical degrees of freedom that can serve as a kind of “buffer” to allow for a momentum 
transfer whenever the momentum of the light ray changes. These geometrical degrees of freedom 
take the role of the Nambu-Goldstone modes appearing when spontaneous Lorentz violation is 
considered. 

To summarize, the current article provides a technique in treating Lorentz-violating photons 
in a curved background in a geometric-optics approximation. As an outlook it will be interesting 
to apply the setup to anisotropic frameworks, first to obtain sensitivities on related controlling 
coefficients and second to study the properties of the underlying Finsler geometry. 
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Appendix A: Classical Lagrangians for massive Lorentz-violating photons 


The first part of the appendix shall briefly demonstrate how to derive the classical Lagrange 


functions in Sec. im from the set of equations (|2.2|), (|2.4l), and (|2.5|). The demonstration will be 


performed for the nonbirefringent, anisotropic case of the CPT-even sector and for a particular 
choice of the CPT-odd framework. The calculation is easier for the CPT-even theory, which is why 
it will be studied first. 


1. CPT-even minimal photon sector 


The base is Eq. (2.19) where for convenience we set (3/2)kI_ = k. For the remaining CPT-even 
cases the procedure works analogously. First of all the modihed dispersion relation for a massive 
photon subject to this particular Lorentz-violating framework reads 


(1 -|- k) (/cq — k\ — k^) — (1 — K)k^ — 




(A.l) 


To obtain the group velocity components it is often reasonable not to solve the dispersion rela¬ 
tion to obtain ko directly, but to differentiate it implicitly with respect to the spatial momentum 
components: 
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For the particular case studied, Eq. (2.4) leads to the following three equations: 
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Evidently only the third one is modified by Lorentz violation mirroring the spatial anisotropy. 
These relations can be solved directly to express the spatial momentum components via ko'- 
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(A.4) 


We can now use Eq. (2.5) and express the spatial momentum components by taking into account 
the previously obtained results of Eq. (A.4): 


L = -{kou^ + kiu^ -b k2U^ + kou^) = ^ 




1 -b K 
1 — K 


3\2 




(A.5) 


The latter is solved with respect to ko giving an expression comprising the (unknown) Lagrange 
function and the four-velocity components: 
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Now all four-momentum components in the dispersion relation can be eliminated via Eq. (A.4) 


and a subsequent insertion of Eq. (A.6): 

1 + K kf 


0 = 


0 = L 


1 . {(1 - ^) [(^°)^ - - (1 + > 
1 — Kj lu ) 

(1 - k )(1 + k ) 


— 


(1 — k) — {u^Y — (u^)^] — (1 + k){u‘^Y 


(A. 7a) 


(A.7b) 


The final equation comprises a polynomial of the Lagrangian whose coefficients depend on four- 
velocity components only. The polynomial must be solved to give L: 
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The result corresponds to Eq. (2.25a). The procedure shown is typically applied to derive classical 


Lagrangians. Four of the five equations are employed to eliminate all four-momentum components 
and to obtain a polynomial equation in L that only comprises the four-velocity. The latter is then 
solved with respect to L finally. 

2. CPT-odd minimal photon sector 


Due to observer Lorentz invariance without a loss of generality = (0)0,0,!)'^ will be 

chosen for the spacelike case. The modified dispersion relation involves an isotropic contribution 
and a second term that does not comprise the momentum component parallel to the preferred 
spacetime direction: 
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The group velocity components are obtained by implicit differentiation of Eq. (A.9) with respect 
to the spatial momentum components: 
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Since the preferred spacetime direction points along the third axis of the coordinate system, the 
first and second group velocity components remain standard where only the third one is modified: 
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(A.ll) 
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Therefore Eq. (2.4) results in 


ko 


u 




h 

ko 


u 




kz{kl - k^) 


ko{kl - k2 - 2771,23) 


u 




(A. 12 ) 


The first two of these relationships allow for writing ki and k 2 in terms of fco- However the third 
equation would lead to a cumbersome third-order polynomial to be solved, which is not a reasonable 


step to take. It is better to insert the first two of Eq. (A.12) into Eq. (2.5) and to solve the latter 


with respect to k^. Then it is possible to express k^ via ko only: 


h = 


1 


{ko [{ur + {uY - - Lu^} ■ 


(A.13) 


Now we can express all spatial momentum components via ko. Hence we can eliminate all of them 


in Eq. (A.9) to obtain an equation that only involves ko- This can be solved to write ko in terms 


of four-velocity components and the Lagrangian where one of the solutions reads 


ko = — 


+ 2mcsy^v^L + 






(A. 14a) 


(A.14b) 


Here > 0 has been assumed for simplicity. The last step is to eliminate all four-momentum 
components in the third of Eq. (A.12) to obtain a polynomial equation for L: 


L'^ + 2mQQ^^ u\L + =0, 

which leads to the Lagrange functions 


= mcs 


'V? — \l 


(A.15) 


(A.16) 


Reinstating the preferred spacetime direction, it is possible to write the latter in the form of 


Eq. (2.14). Using the other solution of ko similar to Eq. (A.14) the Lagrangians with the opposite 


signs are obtained. A computation for < 0 leads to analogous results. Due to observer Lorentz 
invariance the form of the Lagrangian stays the same for general spacelike kAF- 


Appendix B: Light deflection in Schwarzschild spacetimes 

In HZ] it was found that a constant refractive index n 7 ^ 1 due to Lorentz violation leads to 
a change in light deflection. This result is in contrast to what we obtain from Bouguer’s formula 
in Sec. IV Al A rough explanation is that Bouguer’s formula relies on the eikonal equation, which 
is equivalent to the null geodesic equations only for a weak gravitational field. However the latter 
reference is based on a Schwarzschild metric, 

dr^ = (^1 - dt^ - (^1 - dr2-r2(d02 + sin2 0d())2), (B.l) 
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which in this form is not generally isotropic. To get a more profound understanding, consider the 


geodesic equations for a photon in a generally isotropic spacetime of Eq. (5.3) with A = A(r). The 


Christoffel symbols are computed in Riemmanian geometry according to Eq. (7.4) and the geodesic 
equations read 


dx^ dx^ dx^ 

dA ^ dA dA ’ 


(x^) = 


(B.2) 


In what follows, differentiation with respect to the curve parameter A and with respect to r, 
respectively, will be denoted by a dot and a prime. The geodesic equations can then be cast into 
the following form; 


.. A' . 

0 = i -rt , 

A 

0 = f+Tr2- 


A' 


A' 




A' 


'2 A'' 

O = 0+ ( —sin(0) cos{9)4>"^ = 0 , 


'2 A'' 

O = 0+ (- + —)r(/) + 2 cot{6)94> = 0 . 


0 = - Ar'^ - Ar^ 

A 


12 


r+ siiT(0)(/) 


(B.3a) 

(B.3b) 

(B.3c) 

(B.3d) 

(B.3e) 


where the fifth of those is the condition for a null-trajectory. They correspond to the equations 
stated in [86] in case that A is a function of the radial coordinate r only. Now the right-hand side 


of Eq. (B.3a) can be written as the derivative of a conserved quantity that is denoted as Kq in [86] : 


0 = ^ 


d f i 


dX VtI 


^ Kn = 


A ’ 


i = KnA. 


(BA) 


With the choice of 9 = 7r/2 Eq. (B.3c) is fulfilled automatically. Using the previous results. 


Eq. (B.3d) can be expressed as the time-derivative of another conserved quantity Ki: 


0 = 


1 d 
Ar'^ dA 


(Ar^^) ^ Ki = Ar'^cj) , ^ = 


Ar"^ 


(B.5) 


Looking at Eq. (B.16) we see that both Kq and Ki correspond to the conserved quantities that are 


obtained via the Killing vectors, cf. App. B 1 From now on the trajectory shall be parameterized 
with respect to proper time: A = r. Since Kq is then linked to infinitesimal time translations, 
it is reasonable to identify it with the total photon energy E. Furthermore Ki is connected to 
infinitesimal changes in the angle (f), which is why it corresponds to the angular momentum L. 
When these conserved quantities are compared to Eqs. (5.7a,b) in [87] we see that the energy is 


the same, but the angular momentum differs by an additional factor of A. Finally Eq. (B.3b) can 
be written as follows: 


0 = 


1 d 
2Ar dr 


- E'^A + 




^ K2 = AX'^ - E^A + 


r'^A 


(B.6) 
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Therefore the latter comprises even another conserved quantity K 2 ■ Setting K 2 = 0 is in accordance 


with the null-trajectory condition of Eq. (B.3e). Taking into account that r = (dr/d(/>)0 where (j) is 


again expressed by the conserved angular momentum, it is possible to solve Eq. (B.6) with respect 
to d(p/dr: 


d(p 

dr 


L 




r2 _ ^2 


(p(r) = L 


dr 


/ro 




(B.7) 


Comparing to Eq. (5.10) we see that C = L/E, i.e., the constant appearing in Bouguer’s formula 
can be understood as the ratio of angular momentum and total energy. 


Now there are some differences between the final result of Eq. (B.7) and the corresponding 


Eq. (5.9) in jSTj. In the latter paper a black-hole gravitational background is considered in 
Schwarzschild coordinates. This line interval does not have the form of a generally isotropic metric 


given in Eq. (5.6a). In fact, there are isotropic coordinates allowing us to write the Schwarzschild 


solution in the form (see, e.g., page 93 of my- 
~ + ■\/r(r - 2GM)^ , 


(B.8a) 


dr^ = 


l + OM/Pri ) 


Using this set of coordinates the equation encoding angular momentum conservation and the change 
in the angle (p with respect to the new radial coordinate p read as follows: 


L — 9ppP P ) 


dp L 


\/ 9pp9tt 


L 


^ ^E‘^9lpP^ - L^9pp9tt P ^E'^gjpp- 


2 -L2 


-h© 


gmV 


(B.9a) 


(B.9b) 


The first corresponds to Eq. (B.5) and the second to Eq. ( B.7| ) neglecting second-order gravity 
effects. Multiplying the modified line interval of their Eq. (4.12) by the constant vT^Pe leads to 


dT^ = 


\/l +' 


, 2GM\ , 2 /q- 

I-— j dt^ - VTTe 


I 


I - 2GM/r 


dr^ + dn^ 


(B.IO) 


Since photons move on null-trajectories, dr2 = 0 anyhow, which is why a multiplication of the 
line element by a constant should not change the physics. In this case the Lorentz-violating 
contribution governed by a position-independent e drops out of dp/dp when taking into account 


Eq. (B.9a). Therefore in the isotropic coordinates the particular Lorentz-violating contribution of 


their case 3 produces second-order gravity effects associated to Lorentz violation. Far away from 
the back-hole event horizon there are no novel physical effects and this corresponds to the outcome 
of the eikonal approach. 


1. Killing vectors of a spherically symmetric spacetime 


In the current paragraph the Killing vectors for a spherically symmetric spacetime, cf. Eq. (5.3) 


with A{r, 6, p) = A{r), will be listed. The Killing vectors describe infinitesimal isometries for a 
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spacetime and they are linked to underlying symmetries and conserved quantities. In general they 
are obtained from a set of partial differential equations called the Killing equations: 


Daii3 + DjS^a = 0 , = dy^x - ) 


(B.ll) 


with the covariant derivative Da and the Christoffel symbols ^^^x- latter can be directly 

extracted from Eq. (B.3). For the spherically symmetric spacetime it is possible to solve the 
Killing equations analytically. To do so, it is reasonable to make a certain Ansatz, e.g., one with 
vanishing spatial components of This simplifies the set of equations dramatically where several 
are immediately fulfilled automatically. They are then solved successively to obtain four Killing 
vectors. Since the metric is isotropic, it is reasonable to make an Ansatz for that only involves 
a nonvanishing timelike component that does not depend on time itself: 


(U = 


f^o{r,9,(p)\ 
0 
0 
0 


(B.12) 


In this case the following three differential equations must to be solved: 
A ^ dr ' dd ’90 


(B.13) 


The remaining ones are fulfilled automatically. The latter two tell us immediately that neither 
depends on 9 nor 0. Therefore the first differential equation is an ordinary one that can be solved 
directly by integration: 




A' 


T = —E ^ l^ol = - In \coA\ => ^o{r) = 


Co 


Co, Co G 


6 A A(r)’ 

A similar approach leads to the remaining Killing vectors. In total one obtains 


(B.14) 




0 
0 

V 0 / 




0 
0 

sin0 
\^sin 9 cos 9 cos 0y 


(B.15a) 


d^) = 


0 
0 

cos 0 
\^— sin 9 cos 9 sin 0y 


d^) = 


/ 0 \ 
0 
0 

^^sin^ 9 y 


(B.15b) 


Suitable contractions of the Killing vectors with (x^) = (t, r, 0, 0)^ (and additional linear combi¬ 
nations) lead to conserved quantities. The first conserved quantity follows from a contraction with 
the first Killing vector: 


d^d'^ = — = const. 
^ A 


(B.16a) 
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The latter corresponds to the result obtained in Eq. (B.lGa) and it is related to energy conservation. 


The second conserved quantity involves the remaining Killing vectors where it is understood to be 
evaluated at 0 = 7r/2: 


\ 




i=2 


e=7r/2 


= = const. 


(B.16b) 


This conserved quantity is the same as what was obtained in Eq. (B.5) and it means angular 
momentum conservation. Hence the Killing vectors for i = 2... 4 are related to rotational 
symmetry of the spherically symmetric spacetime. 


Appendix C: Eikonal equation for inhomogeneous and anisotropic media 

The current section serves with providing some general results on the physics of the eikonal 
equation, which are used in Sec. |V] extensively. In general, the eikonal equation provides a set 
of three coupled nonlinear differential equations. In what follows a refractive index bearing a 


dependence on the radial distance r and an angle (j) is assumed (cf., e.g., Eq. (5.23)). The photon 


trajectory shall be parameterized by the angle cj), i.e., r{4>) = r{(j))er{4>). Its first and second 
derivative read 


v' = rGr + re^f, , r" = (r — r)er + 2re,^ . 

The arc length depends on cj) and we obtain a set of useful relationships: 


K'/’) = J d0' |r'| = J d(j)' \/r2 + ^ = -y/r^ + , 


- 


d4> J 


j >‘2 I ^2 


dr dr / ds 

ds d4> Vd</> 


d / ds 
dcj) \d0 

-1 


-1 


dcj) y \/7'2 _j_ j;2 


(r + r)r 

(j.2 _|_ j;2^3/2 ’ 


(C.la) 

(C.lb) 

(C.lc) 

(C.ld) 


Now the derivative on the left-hand side of the eikonal equation can be computed. Instead of 
differentiating with respect to the arc length we have to calculate derivatives with respect to (f>, 
which leads to three terms: 


ds 


dr / ds 


n 


dcj) 


-1 


dn dr / ds 


ds d^ \d4> 


-1 


-|- n 


d^r / ds 


d(/)2 \d(?i 


-2 


d / ds 
ds yd(?i 


n 


dr 

dcj) 


dn dr /ds\ ^ d^r / ds 
dcj) dcj) Vd(/>y ^^dcjP' ydc^ 


-2 
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-1 


-1 


dr 


ds\ ' 

- I 71 - 

dcj)) dcj) 


(C.2) 
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Now employing the derivatives of Eq. (C.la) and the identities given in Eq. (C.lb) the eikonal 


equation can be expressed in terms of the radial coordinate r, the angle (j), and the basis vectors: 

dn 1 dn^ 1 f dn ^ ^ dl 

I + re^) + n[[r - rje^ + 2re^J | 

(r + r)r 


Srw I ~ 2 I -2 

or r ocp 


- n{r% + re^)- 


(C.3) 


^^2 _j_ j;2^2 

Sorting terms associated to and e^, respectively, results in a system of two differential equations: 

(r + r)f^ 


dn 

1 

dn 

dr 

ip‘2 _|_ ^^"2 

dcp 

I dn 

1 

dn 

r dcp 

_j_ 

dcp 


-f + n(f — r) 


— n- 


-r + 2nf 


— n 


(r^ _l_ j^2)2 ’ 

(r + r)rf 




(C.4a) 

(C.4b) 


Multiplying the second with r/r and subtracting it from the first eliminates various terms, which 
simplifies the equation drastically: 


n 


dn f dn 


dr d(f) 


2f2 

r-r 

r 


(C.5a) 


0 = (r^ + f2) 


9n f 9n 
r 


+ n(r^ + 2f^ 


rr). 


(C.5b) 


This is the final result that we are interested in and that shall be used for practical purposes. 
However multiplying the latter with rr/{r'^ + ^2^3/2^ written in a form that allows for a 

deeper physical understanding: 


— (nr sin a) —-\/r2~+^^ = 0 , sin a = (C.6) 

acp dcp 


If the refractive index only depends on the radial coordinate the second term on the left-hand side 
vanishes, which then leads us directly to the formula of Bouguer. Physically this result means 
angular momentum conservation. Eor a refractive index that additionally depends on the angle (p 
angular momentum is not a conserved quantity any more. Instead, there is a driving term that 
modifies the angular momentum. The change is bigger the larger the velocity is and the stronger 
the refractive index changes with the angle. 
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